Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 671 papers

Title	Date	Tasks	Status	Hype	Score
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs	May 16, 2025	Information RetrievalKnowledge Graphs	CodeCode Available	1	5
Understanding Differential Search Index for Text Retrieval	May 3, 2023	Information RetrievalRetrieval	CodeCode Available	1	5
Rethinking Benchmarks for Cross-modal Image-text Retrieval	Apr 21, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1	5
Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift	Dec 15, 2022	BenchmarkingImage Captioning	CodeCode Available	1	5
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval	Apr 18, 2021	RetrievalText Retrieval	CodeCode Available	1	5
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner	May 19, 2023	Dense CaptioningImage Captioning	CodeCode Available	1	5
Cross-modal Contrastive Learning for Speech Translation	May 5, 2022	Contrastive LearningRetrieval	CodeCode Available	1	5
Global and Local Semantic Completion Learning for Vision-Language Pre-training	Jun 12, 2023	cross-modal alignmentImage-text Retrieval	CodeCode Available	1	5
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition	Jan 1, 2021	Image-text RetrievalMedical Image Analysis	CodeCode Available	1	5
CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision	Dec 14, 2021	Contrastive LearningRepresentation Learning	CodeCode Available	1	5
Multimodal Federated Learning via Contrastive Representation Ensemble	Feb 17, 2023	Federated LearningImage-text Retrieval	CodeCode Available	1	5
Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control	Feb 27, 2024	GPUImage Retrieval	CodeCode Available	1	5
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations	Jun 14, 2023	image-classificationImage Classification	CodeCode Available	1	5
GOAL: Global-local Object Alignment Learning	Mar 22, 2025	DescriptiveObject	CodeCode Available	1	5
Rethink Training of BERT Rerankers in Multi-Stage Retrieval Pipeline	Jan 21, 2021	RetrievalText Retrieval	CodeCode Available	1	5
Nearest Neighbor Normalization Improves Multimodal Retrieval	Oct 31, 2024	Cross-Modal RetrievalImage Captioning	CodeCode Available	1	5
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers	May 27, 2023	Image CaptioningImage Retrieval	CodeCode Available	1	5
Video-Text Pre-training with Learned Regions	Dec 2, 2021	Representation LearningRetrieval	CodeCode Available	1	5
Generative Multi-hop Retrieval	Apr 27, 2022	DecoderGPU	CodeCode Available	1	5
ALIP: Adaptive Language-Image Pre-training with Synthetic Caption	Aug 16, 2023	Action ClassificationImage-text Retrieval	CodeCode Available	1	5
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits	Feb 12, 2021	CPUDocument Ranking	CodeCode Available	1	5
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning	Oct 27, 2022	Language ModelingLanguage Modelling	CodeCode Available	1	5
CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback	Jun 19, 2021	Image RetrievalImage-text Retrieval	CodeCode Available	1	5
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search	Dec 30, 2024	RAGRetrieval	CodeCode Available	1	5
On Metric Learning for Audio-Text Cross-Modal Retrieval	Mar 29, 2022	AudioCapsCross-Modal Retrieval	CodeCode Available	1	5
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning	Mar 19, 2024	Diagnosticimage-classification	CodeCode Available	1	5
Fast and Light-Weight Answer Text Retrieval in Dialogue Systems	May 27, 2022	Re-RankingRetrieval	CodeCode Available	1	5
ComCLIP: Training-Free Compositional Image and Text Matching	Nov 25, 2022	Image-text matchingImage-text Retrieval	CodeCode Available	1	5
FETA: Towards Specializing Foundation Models for Expert Task Applications	Sep 8, 2022	Domain GeneralizationFew-Shot Learning	CodeCode Available	1	5
MLLMs-Augmented Visual-Language Representation Learning	Nov 30, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1	5
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions	May 28, 2023	AttributeImage Captioning	CodeCode Available	1	5
Learning Video Context as Interleaved Multimodal Sequences	Jul 31, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network	Jan 1, 2023	Image-text matchingRetrieval	CodeCode Available	1	5
Composing Object Relations and Attributes for Image-Text Matching	Jun 17, 2024	AttributeGraph Attention	CodeCode Available	1	5
Helping Hands: An Object-Aware Ego-Centric Video Recognition Model	Aug 15, 2023	DecoderObject	CodeCode Available	1	5
GLEN: Generative Retrieval via Lexical Index Learning	Nov 6, 2023	Learning-To-RankRetrieval	CodeCode Available	1	5
Consensus-Aware Visual-Semantic Embedding for Image-Text Matching	Jul 17, 2020	Image CaptioningImage-text matching	CodeCode Available	1	5
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning	Apr 7, 2021	Representation LearningRetrieval	CodeCode Available	1	5
From Unimodal to Multimodal: Scaling up Projectors to Align Modalities	Sep 28, 2024	Image-text RetrievalSemantic Similarity	CodeCode Available	0	5
ProCIS: A Benchmark for Proactive Retrieval in Conversations	May 10, 2024	RetrievalText Retrieval	CodeCode Available	0	5
Attacking Attention of Foundation Models Disrupts Downstream Tasks	Jun 3, 2025	Depth EstimationImage-text Retrieval	CodeCode Available	0	5
Pre-trained Language Models Can be Fully Zero-Shot Learners	Dec 14, 2022	Retrievaltext-classification	CodeCode Available	0	5
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors	Feb 20, 2025	AudioCapsContrastive Learning	CodeCode Available	0	5
Partial Scene Text Retrieval	Nov 15, 2024	Multiple Instance LearningRetrieval	CodeCode Available	0	5
Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated Images	Nov 23, 2023	Cross-Modal RetrievalImage Retrieval	CodeCode Available	0	5
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis	Jul 29, 2024	Image-text RetrievalModel Selection	CodeCode Available	0	5
OTE: Exploring Accurate Scene Text Recognition Using One Token	Jan 1, 2024	DecoderScene Text Recognition	CodeCode Available	0	5
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts	May 24, 2023	Dialogue State TrackingImage Retrieval	CodeCode Available	0	5
A Hybrid Retrieval-Generation Neural Conversation Model	Apr 19, 2019	Diversitymodel	CodeCode Available	0	5
Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval	Apr 6, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	0	5

Show:10 25 50

← PrevPage 5 of 14Next →

No leaderboard results yet.