Image-text matching

Image-Text Matching is a subtask within Cross-Modal Retrieval (CMR) that involves establishing associations between images and corresponding textual descriptions. The goal is to retrieve an image given a textual query or, conversely, retrieve a textual description given an image query. This task is challenging due to the heterogeneity gap between image and text data representations. Image-text matching is used in applications such as content-based image search, visual question answering, and multimodal summarization.

Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–188 of 188 papers

Title	Date	Tasks	Status	Hype
Visual Semantic Reasoning for Image-Text Matching	Sep 6, 2019	Cross-Modal RetrievalImage Retrieval	CodeCode Available	1
VL-BERT: Pre-training of Generic Visual-Linguistic Representations	Aug 22, 2019	Image-text matchingLanguage Modelling	CodeCode Available	1
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training	Aug 16, 2019	Image-text matchingImage-text Retrieval	—Unverified	0
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking	Aug 12, 2019	Binary ClassificationGeneral Classification	CodeCode Available	0
Knowledge Aware Semantic Concept Expansion for Image-Text Matching	Aug 10, 2019	Common Sense ReasoningContent-Based Image Retrieval	—Unverified	0
Position Focused Attention Network for Image-Text Matching	Jul 23, 2019	Image-text matchingPosition	CodeCode Available	0
ParNet: Position-aware Aggregated Relation Network for Image-Text matching	Jun 17, 2019	Image-text matchingPosition	—Unverified	0
Deep Cross-Modal Projection Learning for Image-Text Matching	Sep 1, 2018	Cross-Modal RetrievalImage-text matching	CodeCode Available	0
Stacked Cross Attention for Image-Text Matching	Mar 21, 2018	Cross-Modal RetrievalImage Retrieval	CodeCode Available	1
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks	Nov 28, 2017	Generative Adversarial NetworkImage Generation	CodeCode Available	1
Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval	May 28, 2017	Cross-Modal RetrievalImage Retrieval	—Unverified	0
Learning Two-Branch Neural Networks for Image-Text Matching Tasks	Apr 11, 2017	Image-text matchingRetrieval	CodeCode Available	0
Dual Attention Networks for Multimodal Reasoning and Matching	Nov 2, 2016	Collaborative InferenceImage-text matching	CodeCode Available	0

Show:10 25 50

← PrevPage 8 of 8Next →

No leaderboard results yet.