Image-text matching

Image-Text Matching is a subtask within Cross-Modal Retrieval (CMR) that involves establishing associations between images and corresponding textual descriptions. The goal is to retrieve an image given a textual query or, conversely, retrieve a textual description given an image query. This task is challenging due to the heterogeneity gap between image and text data representations. Image-text matching is used in applications such as content-based image search, visual question answering, and multimodal summarization.

Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 188 papers

Title	Date	Tasks	Status	Hype
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP	Mar 5, 2025	Adversarial RobustnessImage-text matching	CodeCode Available	1
IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis	Mar 2, 2025	Image SegmentationImage-text matching	CodeCode Available	1
ReCon: Enhancing True Correspondence Discrimination through Relation Consistency for Robust Noisy Correspondence Learning	Feb 27, 2025	Cross-Modal RetrievalCross-modal retrieval with noisy correspondence	CodeCode Available	1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation	Feb 27, 2025	Image-text matchingObject	CodeCode Available	1
Image-text matching for large-scale book collections	Jul 29, 2024	Image-text matchingOptical Character Recognition (OCR)	CodeCode Available	1
UGNCL: Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching	Jul 11, 2024	Cross-Modal RetrievalCross-modal retrieval with noisy correspondence	CodeCode Available	1
Composing Object Relations and Attributes for Image-Text Matching	Jun 17, 2024	AttributeGraph Attention	CodeCode Available	1
Deep Boosting Learning: A Brand-new Cooperative Approach for Image-Text Matching	Apr 28, 2024	Contrastive LearningImage-text matching	CodeCode Available	1
RadCLIP: Enhancing Radiologic Image Analysis through Contrastive Language-Image Pre-training	Mar 15, 2024	Diagnosticimage-classification	CodeCode Available	1
ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation	Feb 7, 2024	Image GenerationImage-text matching	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 19Next →

No leaderboard results yet.