SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 125 of 671 papers

TitleStatusHype
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language ModelsCode7
h2oGPT: Democratizing Large Language ModelsCode6
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive RetrievalCode5
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and GenerationCode5
BM25S: Orders of magnitude faster lexical search via eager sparse scoringCode5
MTEB: Massive Text Embedding BenchmarkCode4
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality TeachersCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text RetrieversCode4
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic AlignmentCode4
RETSim: Resilient and Efficient Text SimilarityCode4
Multi-label Cluster Discrimination for Visual Representation LearningCode4
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal UnderstandingCode3
Vision-Language Pre-training: Basics, Recent Advances, and Future TrendsCode3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited ModalitiesCode3
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation ModelsCode3
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content CreationCode3
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language ModelsCode3
Egocentric Video-Language PretrainingCode2
Efficient Remote Sensing with Harmonized Transfer Learning and Modality AlignmentCode2
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval ModelsCode2
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language ModelsCode2
Distillation Enhanced Generative RetrievalCode2
Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse RepresentationsCode2
Show:102550
← PrevPage 1 of 27Next →

No leaderboard results yet.