Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 671 papers

Title	Date	Tasks	Status	Hype	Score
CLIP2Video: Mastering Video-Text Retrieval via Image CLIP	Jun 21, 2021	Language ModelingLanguage Modelling	CodeCode Available	1	5
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval	Apr 18, 2021	RetrievalText Retrieval	CodeCode Available	1	5
A Comprehensive Review of the Video-to-Text Problem	Mar 27, 2021	Question AnsweringRetrieval	CodeCode Available	1	5
Extending Multi-modal Contrastive Representations	Oct 13, 2023	3D Object ClassificationRepresentation Learning	CodeCode Available	1	5
Eye-gaze Guided Multi-modal Alignment for Medical Representation Learning	Mar 19, 2024	Diagnosticimage-classification	CodeCode Available	1	5
Bridging Language Gaps in Audio-Text Retrieval	Jun 11, 2024	AudioCapsRetrieval	CodeCode Available	1	5
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits	Feb 12, 2021	CPUDocument Ranking	CodeCode Available	1	5
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration	May 26, 2024	Information RetrievalRetrieval	CodeCode Available	1	5
LinkTransformer: A Unified Package for Record Linkage with Transformer Language Models	Sep 2, 2023	BlockingLanguage Modelling	CodeCode Available	1	5
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning	Oct 27, 2022	Language ModelingLanguage Modelling	CodeCode Available	1	5
Bridging Video-text Retrieval with Multiple Choice Questions	Jan 13, 2022	Action RecognitionLinear evaluation	CodeCode Available	1	5
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak Decoder	Nov 1, 2021	DecoderLanguage Modeling	CodeCode Available	1	5
ComCLIP: Training-Free Compositional Image and Text Matching	Nov 25, 2022	Image-text matchingImage-text Retrieval	CodeCode Available	1	5
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark	Aug 5, 2024	Dense Video CaptioningDiversity	CodeCode Available	1	5
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval	Feb 6, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Composing Object Relations and Attributes for Image-Text Matching	Jun 17, 2024	AttributeGraph Attention	CodeCode Available	1	5
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition	Jan 1, 2021	Image-text RetrievalMedical Image Analysis	CodeCode Available	1	5
Consensus-Aware Visual-Semantic Embedding for Image-Text Matching	Jul 17, 2020	Image CaptioningImage-text matching	CodeCode Available	1	5
ESA: External Space Attention Aggregation for Image-Text Retrieval	Oct 10, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Graph Optimal Transport for Cross-Domain Alignment	Jun 26, 2020	Graph MatchingImage Captioning	CodeCode Available	1	5
A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval	Jun 4, 2021	Graph MatchingImage Retrieval	CodeCode Available	1	5
Learning to Rank in Generative Retrieval	Jun 27, 2023	Learning-To-RankPassage Ranking	CodeCode Available	1	5
Nonparametric Decoding for Generative Retrieval	Oct 5, 2022	DecoderLanguage Modelling	CodeCode Available	1	5
Audio Retrieval with Natural Language Queries: A Benchmark Study	Dec 17, 2021	AudioCapsAudio captioning	CodeCode Available	1	5
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding	Jun 15, 2023	Contrastive Learningimage-classification	CodeCode Available	1	5
Contrastive Audio-Language Learning for Music	Aug 25, 2022	Audio to Text RetrievalDescriptive	CodeCode Available	1	5
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval	Jan 1, 2023	image-classificationImage Classification	CodeCode Available	1	5
Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory	Mar 19, 2024	Adversarial TextDiversity	CodeCode Available	1	5
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner	May 19, 2023	Dense CaptioningImage Captioning	CodeCode Available	1	5
Learning Semantic Relationship Among Instances for Image-Text Matching	Jan 1, 2023	Cross-Modal RetrievalImage Retrieval	CodeCode Available	1	5
Equivariant Similarity for Vision-Language Foundation Models	Mar 25, 2023	Image-text RetrievalRetrieval	CodeCode Available	1	5
Learning the Best Pooling Strategy for Visual Semantic Embedding	Nov 9, 2020	Cross-Modal Information RetrievalImage-text Retrieval	CodeCode Available	1	5
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval	Mar 16, 2021	Image-text RetrievalRe-Ranking	CodeCode Available	1	5
LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration	Feb 18, 2024	Multi-hop Question AnsweringQuestion Answering	CodeCode Available	1	5
Large-Scale Adversarial Training for Vision-and-Language Representation Learning	Jun 11, 2020	Image-text RetrievalQuestion Answering	CodeCode Available	1	5
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval	Mar 11, 2022	Contrastive LearningRe-Ranking	CodeCode Available	1	5
LDMol: Text-to-Molecule Diffusion Model with Structurally Informative Latent Space	May 28, 2024	Contrastive LearningDecoder	CodeCode Available	1	5
A Data-Centric Framework for Composable NLP Workflows	Mar 2, 2021	RetrievalText Retrieval	CodeCode Available	1	5
Learnable Pillar-based Re-ranking for Image-Text Retrieval	Apr 25, 2023	Image-text RetrievalRe-Ranking	CodeCode Available	1	5
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training	Jun 15, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1	5
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models	Jun 10, 2025	Contrastive LearningImage-text matching	CodeCode Available	1	5
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment	Aug 29, 2022	cross-modal alignmentImage-text Retrieval	CodeCode Available	1	5
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training	Jun 1, 2022	Contrastive LearningCross-Lingual Transfer	CodeCode Available	1	5
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling	Apr 14, 2021	GPURe-Ranking	CodeCode Available	1	5
Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval	Oct 11, 2019	Graph MatchingImage-text Retrieval	CodeCode Available	1	5
Learning a Text-Video Embedding from Incomplete and Heterogeneous Data	Apr 7, 2018	RetrievalText Retrieval	CodeCode Available	1	5
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation	Jul 1, 2024	Image-text RetrievalQuestion Answering	CodeCode Available	1	5
Cross-Modal Retrieval with Partially Mismatched Pairs	Feb 22, 2023	Contrastive LearningCross-Modal Retrieval	CodeCode Available	1	5
Cross-Modal Retrieval for Motion and Text via DopTriple Loss	May 7, 2023	Cross-Modal RetrievalRetrieval	CodeCode Available	1	5
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval	Jun 10, 2025	Image CaptioningRetrieval	CodeCode Available	1	5

Show:10 25 50

← PrevPage 3 of 14Next →

No leaderboard results yet.