SOTAVerified|Agents Browse Leaderboard About

Image-text matching

Image-Text Matching is a subtask within Cross-Modal Retrieval (CMR) that involves establishing associations between images and corresponding textual descriptions. The goal is to retrieve an image given a textual query or, conversely, retrieve a textual description given an image query. This task is challenging due to the heterogeneity gap between image and text data representations. Image-text matching is used in applications such as content-based image search, visual question answering, and multimodal summarization.

Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 131–140 of 188 papers

Title	Date	Tasks	Status	Hype
Probing the Role of Positional Information in Vision-Language Models	Jan 16, 2022	Contrastive LearningImage-text matching	—Unverified	0
Negative-Aware Attention Framework for Image-Text Matching	Jan 1, 2022	Image-text matchingText Matching	CodeCode Available	1
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation	Dec 10, 2021	Image-text matchingImage-text Retrieval	—Unverified	0
Embedding Arithmetic of Multimodal Queries for Image Retrieval	Dec 6, 2021	Image RetrievalImage-text matching	—Unverified	0
DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting	Dec 2, 2021	Image-text matchingInstance Segmentation	CodeCode Available	1
Learning with Noisy Correspondence for Cross-modal Matching	Dec 1, 2021	Cross-Modal RetrievalCross-modal retrieval with noisy correspondence	CodeCode Available	1
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning	Nov 19, 2021	Image CaptioningImage-text matching	—Unverified	0
More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching	Nov 16, 2021	Contrastive LearningImage-text matching	—Unverified	0
MURAL: Multimodal, Multitask Representations Across Languages	Nov 1, 2021	Cross-Modal RetrievalImage-text matching	—Unverified	0
Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching	Oct 6, 2021	Image CaptioningImage-text matching	—Unverified	0

Show:10 25 50

← PrevPage 14 of 19Next →

No leaderboard results yet.