SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 151175 of 671 papers

TitleStatusHype
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge BasesCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video RetrievalCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Cross-modal Scene Graph Matching for Relationship-aware Image-Text RetrievalCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence RegularizationCode1
Cross-Modal Retrieval with Partially Mismatched PairsCode1
Cross-Modal Retrieval for Motion and Text via DopTriple LossCode1
LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text RetrievalCode1
Densifying Sparse Representations for Passage Retrieval by Representational SlicingCode1
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text RetrievalCode1
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial TrajectoryCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal MappingCode1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text RetrievalCode1
DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text RetrievalCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse RetrievalCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
LinkTransformer: A Unified Package for Record Linkage with Transformer Language ModelsCode1
Cross-modal Contrastive Learning for Speech TranslationCode1
Show:102550
← PrevPage 7 of 27Next →

No leaderboard results yet.