Comparative analysis on cross-modal information retrieval: A review

Parminder Kaur*, Husanbir Singh Pannu, Avleen Kaur Malhi

*Corresponding author for this work

Research output: Contribution to journalReview Articlepeer-review

23 Citations (Scopus)


Human beings experience life through a spectrum of modes such as vision, taste, hearing, smell, and touch. These multiple modes are integrated for information processing in our brain using a complex network of neuron connections. Likewise for artificial intelligence to mimic the human way of learning and evolve into the next generation, it should elucidate multi-modal information fusion efficiently. Modality is a channel that conveys information about an object or an event such as image, text, video, and audio. A research problem is said to be multi-modal when it incorporates information from more than a single modality. Multi-modal systems involve one mode of data to be inquired for any (same or varying) modality outcome whereas cross-modal system strictly retrieves the information from a dissimilar modality. As the input-output queries belong to diverse modal families, their coherent comparison is still an open challenge with their primitive forms and subjective definition of content similarity. Numerous techniques have been proposed by researchers to handle this issue and to reduce the semantic gap of information retrieval among different modalities. This paper focuses on a comparative analysis of various research works in the field of cross-modal information retrieval. Comparative analysis of several cross-modal representations and the results of the state-of-the-art methods when applied on benchmark datasets have also been discussed. In the end, open issues are presented to enable the researchers to a better understanding of the present scenario and to identify future research directions.

Original languageEnglish
Article number100336
Number of pages36
JournalComputer Science Review
Publication statusPublished - Feb 2021
MoE publication typeA2 Review article in a scientific journal


  • Comparative analysis
  • Cross-modal
  • Data fusion
  • Information retrieval
  • Multimedia


Dive into the research topics of 'Comparative analysis on cross-modal information retrieval: A review'. Together they form a unique fingerprint.

Cite this