Image-text matching

Image-Text Matching is a subtask within Cross-Modal Retrieval (CMR) that involves establishing associations between images and corresponding textual descriptions. The goal is to retrieve an image given a textual query or, conversely, retrieve a textual description given an image query. This task is challenging due to the heterogeneity gap between image and text data representations. Image-text matching is used in applications such as content-based image search, visual question answering, and multimodal summarization.

Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–188 of 188 papers

Title	Date	Tasks	Status
ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data	Jan 22, 2020	Image RetrievalImage-text matching	—Unverified
Learning fragment self-attention embeddings for image-text matching	Oct 1, 2019	Image-text matchingSentence	CodeCode Available
UNITER: Learning UNiversal Image-TExt Representations	Sep 25, 2019	Image-text matchingImage-text Retrieval	—Unverified
Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators	Sep 22, 2019	Image CaptioningImage-text matching	—Unverified
Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training	Aug 16, 2019	Image-text matchingImage-text Retrieval	—Unverified
Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking	Aug 12, 2019	Binary ClassificationGeneral Classification	CodeCode Available
Knowledge Aware Semantic Concept Expansion for Image-Text Matching	Aug 10, 2019	Common Sense ReasoningContent-Based Image Retrieval	—Unverified
Position Focused Attention Network for Image-Text Matching	Jul 23, 2019	Image-text matchingPosition	CodeCode Available
ParNet: Position-aware Aggregated Relation Network for Image-Text matching	Jun 17, 2019	Image-text matchingPosition	—Unverified
Deep Cross-Modal Projection Learning for Image-Text Matching	Sep 1, 2018	Cross-Modal RetrievalImage-text matching	CodeCode Available
Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval	May 28, 2017	Cross-Modal RetrievalImage Retrieval	—Unverified
Learning Two-Branch Neural Networks for Image-Text Matching Tasks	Apr 11, 2017	Image-text matchingRetrieval	CodeCode Available
Dual Attention Networks for Multimodal Reasoning and Matching	Nov 2, 2016	Collaborative InferenceImage-text matching	CodeCode Available

Show:10 25 50

← PrevPage 8 of 8Next →

No leaderboard results yet.