Image-text Retrieval

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 248 papers

Title	Date	Tasks	Status	Hype	Score
Learnable Pillar-based Re-ranking for Image-Text Retrieval	Apr 25, 2023	Image-text RetrievalRe-Ranking	CodeCode Available	1	5
Learning Relation Alignment for Calibrated Cross-modal Retrieval	May 28, 2021	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1	5
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner	May 19, 2023	Dense CaptioningImage Captioning	CodeCode Available	1	5
Equivariant Similarity for Vision-Language Foundation Models	Mar 25, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
ESA: External Space Attention Aggregation for Image-Text Retrieval	Oct 10, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Learning the Best Pooling Strategy for Visual Semantic Embedding	Nov 9, 2020	Cross-Modal Information RetrievalImage-text Retrieval	CodeCode Available	1	5
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval	Feb 6, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone	Jun 15, 2022	Described Object DetectionImage Captioning	CodeCode Available	1	5
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition	Jan 1, 2021	Image-text RetrievalMedical Image Analysis	CodeCode Available	1	5
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning	Mar 19, 2024	Diagnosticimage-classification	CodeCode Available	1	5
FETA: Towards Specializing Foundation Models for Expert Task Applications	Sep 8, 2022	Domain GeneralizationFew-Shot Learning	CodeCode Available	1	5
Global and Local Semantic Completion Learning for Vision-Language Pre-training	Jun 12, 2023	cross-modal alignmentImage-text Retrieval	CodeCode Available	1	5
A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports	Sep 3, 2020	Image-text RetrievalMedical Visual Question Answering	CodeCode Available	1	5
Graph Optimal Transport for Cross-Domain Alignment	Jun 26, 2020	Graph MatchingImage Captioning	CodeCode Available	1	5
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding	Jun 15, 2023	Contrastive Learningimage-classification	CodeCode Available	1	5
Image-text Retrieval via Preserving Main Semantics of Vision	Apr 20, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1	5
FlexiViT: One Model for All Patch Sizes	Dec 15, 2022	AllImage-text Retrieval	CodeCode Available	1	5
CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback	Jun 19, 2021	Image RetrievalImage-text Retrieval	CodeCode Available	1	5
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models	Mar 25, 2025	BenchmarkingImage Captioning	CodeCode Available	1	5
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers	May 11, 2023	Contrastive LearningImage-text Retrieval	CodeCode Available	1	5
Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models	Jul 26, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark	Jun 10, 2023	Image-text RetrievalMedical Report Generation	CodeCode Available	1	5
UGNCL: Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching	Jul 11, 2024	Cross-Modal RetrievalCross-modal retrieval with noisy correspondence	CodeCode Available	1	5
An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing	Feb 26, 2022	Image-text RetrievalMeta-Learning	CodeCode Available	0	5
Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis	Feb 11, 2023	Image-text RetrievalKnowledge Graphs	CodeCode Available	0	5

Show:10 25 50

← PrevPage 4 of 10Next →

No leaderboard results yet.