Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 671 papers

Title	Date	Tasks	Status	Hype
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation	Jul 1, 2024	Image-text RetrievalQuestion Answering	CodeCode Available	1
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases	Sep 30, 2022	Entity LinkingQuestion Answering	CodeCode Available	1
GOAL: Global-local Object Alignment Learning	Mar 22, 2025	DescriptiveObject	CodeCode Available	1
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval	Oct 9, 2024	RetrievalText Retrieval	CodeCode Available	1
HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking	May 21, 2022	Passage RankingPassage Re-Ranking	CodeCode Available	1
Hyperbolic Image-Text Representations	Apr 18, 2023	image-classificationImage Classification	CodeCode Available	1
Image-text Retrieval via Preserving Main Semantics of Vision	Apr 20, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss	Sep 9, 2021	Mixture-of-ExpertsRetrieval	CodeCode Available	1
Equivariant Similarity for Vision-Language Foundation Models	Mar 25, 2023	Image-text RetrievalRetrieval	CodeCode Available	1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training	Jun 1, 2022	Contrastive LearningCross-Lingual Transfer	CodeCode Available	1
Global and Local Semantic Completion Learning for Vision-Language Pre-training	Jun 12, 2023	cross-modal alignmentImage-text Retrieval	CodeCode Available	1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval	Oct 11, 2019	Graph MatchingImage-text Retrieval	CodeCode Available	1
Densifying Sparse Representations for Passage Retrieval by Representational Slicing	Dec 9, 2021	Passage RetrievalRetrieval	CodeCode Available	1
Large-Scale Adversarial Training for Vision-and-Language Representation Learning	Jun 11, 2020	Image-text RetrievalQuestion Answering	CodeCode Available	1
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue Dataset	Dec 8, 2022	DiversityImage Description	CodeCode Available	1
Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory	Mar 19, 2024	Adversarial TextDiversity	CodeCode Available	1
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition	Jan 1, 2021	Image-text RetrievalMedical Image Analysis	CodeCode Available	1
Learning Semantic Relationship Among Instances for Image-Text Matching	Jan 1, 2023	Cross-Modal RetrievalImage Retrieval	CodeCode Available	1
Learning to Rank in Generative Retrieval	Jun 27, 2023	Learning-To-RankPassage Ranking	CodeCode Available	1
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval	Jun 10, 2025	Image CaptioningRetrieval	CodeCode Available	1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval	Jan 1, 2023	image-classificationImage Classification	CodeCode Available	1
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval	Mar 16, 2021	Image-text RetrievalRe-Ranking	CodeCode Available	1
Bridging Video-text Retrieval with Multiple Choice Questions	Jan 13, 2022	Action RecognitionLinear evaluation	CodeCode Available	1
HANet: Hierarchical Alignment Networks for Video-Text Retrieval	Jul 26, 2021	RetrievalText Matching	CodeCode Available	1
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain	Mar 30, 2021	Image RetrievalRetrieval	CodeCode Available	1
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model	Oct 11, 2022	Contrastive LearningImage-text matching	CodeCode Available	1
Cross-Modal Retrieval with Partially Mismatched Pairs	Feb 22, 2023	Contrastive LearningCross-Modal Retrieval	CodeCode Available	1
Cross-Modal Retrieval for Motion and Text via DopTriple Loss	May 7, 2023	Cross-Modal RetrievalRetrieval	CodeCode Available	1
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models	Mar 25, 2025	BenchmarkingImage Captioning	CodeCode Available	1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search	Dec 30, 2024	RAGRetrieval	CodeCode Available	1
A Comprehensive Review of the Video-to-Text Problem	Mar 27, 2021	Question AnsweringRetrieval	CodeCode Available	1
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQA	Oct 11, 2022	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	1
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval	Apr 1, 2021	RetrievalText Retrieval	CodeCode Available	1
FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions	May 28, 2023	AttributeImage Captioning	CodeCode Available	1
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and Data	Oct 8, 2023	Action RecognitionContinual Learning	CodeCode Available	1
More Robust Dense Retrieval with Contrastive Dual Learning	Jul 16, 2021	Contrastive LearningInformation Retrieval	CodeCode Available	1
A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval	Oct 27, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	1
Generative Multi-hop Retrieval	Apr 27, 2022	DecoderGPU	CodeCode Available	1
Cross-modal Contrastive Learning for Speech Translation	May 5, 2022	Contrastive LearningRetrieval	CodeCode Available	1
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling	Apr 14, 2021	GPURe-Ranking	CodeCode Available	1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping	Apr 26, 2023	DecoderImage Captioning	CodeCode Available	1
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations	Jun 14, 2023	image-classificationImage Classification	CodeCode Available	1
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language Transformers	May 27, 2023	Image CaptioningImage Retrieval	CodeCode Available	1
Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training	Jun 15, 2023	Image-text RetrievalRepresentation Learning	CodeCode Available	1
Fine-Tuning LLaMA for Multi-Stage Text Retrieval	Oct 12, 2023	Passage RetrievalRetrieval	CodeCode Available	1
ALIP: Adaptive Language-Image Pre-training with Synthetic Caption	Aug 16, 2023	Action ClassificationImage-text Retrieval	CodeCode Available	1
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New Benchmark	Jun 10, 2023	Image-text RetrievalMedical Report Generation	CodeCode Available	1
MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval	Jan 19, 2023	RetrievalText Retrieval	CodeCode Available	1
CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback	Jun 19, 2021	Image RetrievalImage-text Retrieval	CodeCode Available	1
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning	Mar 1, 2020	Cross-Modal RetrievalRetrieval	CodeCode Available	1

Show:10 25 50

← PrevPage 4 of 14Next →

No leaderboard results yet.