SOTAVerified|Agents Browse Leaderboard About

Image-text matching

Image-Text Matching is a subtask within Cross-Modal Retrieval (CMR) that involves establishing associations between images and corresponding textual descriptions. The goal is to retrieve an image given a textual query or, conversely, retrieve a textual description given an image query. This task is challenging due to the heterogeneity gap between image and text data representations. Image-text matching is used in applications such as content-based image search, visual question answering, and multimodal summarization.

Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 71–80 of 188 papers

Title	Date	Tasks	Status	Hype
More Grounded Image Captioning by Distilling Image-Text Matching Model	Apr 1, 2020	Image CaptioningImage-text matching	CodeCode Available	1
Adaptive Offline Quintuplet Loss for Image-Text Matching	Mar 7, 2020	Image-text matchingText Matching	CodeCode Available	1
UNITER: UNiversal Image-TExt Representation Learning	Sep 25, 2019	Image-text matchingImage-text Retrieval	CodeCode Available	1
Visual Semantic Reasoning for Image-Text Matching	Sep 6, 2019	Cross-Modal RetrievalImage Retrieval	CodeCode Available	1
VL-BERT: Pre-training of Generic Visual-Linguistic Representations	Aug 22, 2019	Image-text matchingLanguage Modelling	CodeCode Available	1
Stacked Cross Attention for Image-Text Matching	Mar 21, 2018	Cross-Modal RetrievalImage Retrieval	CodeCode Available	1
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks	Nov 28, 2017	Generative Adversarial NetworkImage Generation	CodeCode Available	1
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP	May 24, 2025	Image CaptioningImage Generation	—Unverified	0
Descriptive Image-Text Matching with Graded Contextual Similarity	May 15, 2025	DescriptiveImage-text matching	—Unverified	0
Compositional Image-Text Matching and Retrieval by Grounding Entities	May 4, 2025	Image CaptioningImage-text matching	CodeCode Available	0

Show:10 25 50

← PrevPage 8 of 19Next →

No leaderboard results yet.