Image-text Retrieval

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 248 papers

Title	Date	Tasks	Status	Hype	Score
A Survey of Medical Vision-and-Language Applications and Their Techniques	Nov 19, 2024	Decision MakingDiagnostic	CodeCode Available	1	5
I0T: Embedding Standardization Method Towards Zero Modality Gap	Dec 18, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	1	5
ALIP: Adaptive Language-Image Pre-training with Synthetic Caption	Aug 16, 2023	Action ClassificationImage-text Retrieval	CodeCode Available	1	5
Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval	Oct 11, 2019	Graph MatchingImage-text Retrieval	CodeCode Available	1	5
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training	Jun 1, 2022	Contrastive LearningCross-Lingual Transfer	CodeCode Available	1	5
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval	Jun 4, 2021	Graph MatchingImage Retrieval	CodeCode Available	1	5
MLLMs-Augmented Visual-Language Representation Learning	Nov 30, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1	5
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation	Jul 1, 2024	Image-text RetrievalQuestion Answering	CodeCode Available	1	5
AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning	Aug 14, 2023	Contrastive LearningGenerative Adversarial Network	CodeCode Available	1	5
FlexiViT: One Model for All Patch Sizes	Dec 15, 2022	AllImage-text Retrieval	CodeCode Available	1	5
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval	Mar 16, 2021	Image-text RetrievalRe-Ranking	CodeCode Available	1	5
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval	Jan 1, 2023	image-classificationImage Classification	CodeCode Available	1	5
Composing Object Relations and Attributes for Image-Text Matching	Jun 17, 2024	AttributeGraph Attention	CodeCode Available	1	5
FILIP: Fine-grained Interactive Language-Image Pre-Training	Nov 9, 2021	image-classificationImage Classification	CodeCode Available	1	5
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping	Apr 26, 2023	DecoderImage Captioning	CodeCode Available	1	5
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models	Mar 25, 2025	BenchmarkingImage Captioning	CodeCode Available	1	5
ComCLIP: Training-Free Compositional Image and Text Matching	Nov 25, 2022	Image-text matchingImage-text Retrieval	CodeCode Available	1	5
Dynamic Modality Interaction Modeling for Image-Text Retrieval	Jul 11, 2021	cross-modal alignmentCross-Modal Retrieval	CodeCode Available	1	5
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning	Mar 19, 2024	Diagnosticimage-classification	CodeCode Available	1	5
Equivariant Similarity for Vision-Language Foundation Models	Mar 25, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift	Dec 15, 2022	BenchmarkingImage Captioning	CodeCode Available	1	5
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval	Oct 27, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1	5
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training	Jun 15, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1	5
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment	Aug 29, 2022	cross-modal alignmentImage-text Retrieval	CodeCode Available	1	5
ESA: External Space Attention Aggregation for Image-Text Retrieval	Oct 10, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5

Show:10 25 50

← PrevPage 3 of 10Next →

No leaderboard results yet.