Image-text Retrieval

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 248 papers

Title	Date	Tasks	Status	Hype
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model	Oct 11, 2022	Contrastive LearningImage-text matching	CodeCode Available	1
Learnable Pillar-based Re-ranking for Image-Text Retrieval	Apr 25, 2023	Image-text RetrievalRe-Ranking	CodeCode Available	1
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval	Oct 27, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1
Learning Relation Alignment for Calibrated Cross-modal Retrieval	May 28, 2021	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1
ESA: External Space Attention Aggregation for Image-Text Retrieval	Oct 10, 2023	Image-text RetrievalRetrieval	CodeCode Available	1
Graph Optimal Transport for Cross-Domain Alignment	Jun 26, 2020	Graph MatchingImage Captioning	CodeCode Available	1
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections	May 24, 2022	Computational Efficiencycross-modal alignment	CodeCode Available	1
Multimodal Federated Learning via Contrastive Representation Ensemble	Feb 17, 2023	Federated LearningImage-text Retrieval	CodeCode Available	1
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark	Jun 10, 2023	Image-text RetrievalMedical Report Generation	CodeCode Available	1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval	Feb 6, 2023	Image-text RetrievalRetrieval	CodeCode Available	1
FETA: Towards Specializing Foundation Models for Expert Task Applications	Sep 8, 2022	Domain GeneralizationFew-Shot Learning	CodeCode Available	1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone	Jun 15, 2022	Described Object DetectionImage Captioning	CodeCode Available	1
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval	Mar 8, 2020	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1
FILIP: Fine-grained Interactive Language-Image Pre-Training	Nov 9, 2021	image-classificationImage Classification	CodeCode Available	1
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding	Jun 15, 2023	Contrastive Learningimage-classification	CodeCode Available	1
I0T: Embedding Standardization Method Towards Zero Modality Gap	Dec 18, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	1
FlexiViT: One Model for All Patch Sizes	Dec 15, 2022	AllImage-text Retrieval	CodeCode Available	1
CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback	Jun 19, 2021	Image RetrievalImage-text Retrieval	CodeCode Available	1
A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports	Sep 3, 2020	Image-text RetrievalMedical Visual Question Answering	CodeCode Available	1
Image-text Retrieval via Preserving Main Semantics of Vision	Apr 20, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1
Large-Scale Adversarial Training for Vision-and-Language Representation Learning	Jun 11, 2020	Image-text RetrievalQuestion Answering	CodeCode Available	1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval	Jan 1, 2023	image-classificationImage Classification	CodeCode Available	1
MLLMs-Augmented Visual-Language Representation Learning	Nov 30, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1
VladVA: Discriminative Fine-tuning of LVLMs	Dec 5, 2024	Image-text RetrievalRepresentation Learning	—Unverified	0
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval	Oct 12, 2023	Cross-Modal RetrievalImage-text Retrieval	—Unverified	0

Show:10 25 50

← PrevPage 4 of 10Next →

No leaderboard results yet.