SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 201225 of 671 papers

TitleStatusHype
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
Fine-Tuning LLaMA for Multi-Stage Text RetrievalCode1
CLIP2Video: Mastering Video-Text Retrieval via Image CLIPCode1
Benchmarking Robustness of Multimodal Image-Text Models under Distribution ShiftCode1
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip RetrievalCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language RepresentationsCode1
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language TransformersCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
CLIP-Lite: Information Efficient Visual Representation Learning with Language SupervisionCode1
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQACode1
MixGen: A New Multi-Modal Data AugmentationCode1
FILIP: Fine-grained Interactive Language-Image Pre-TrainingCode1
MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal AdapterCode1
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionCode1
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents IntegrationCode1
Mr. Right: Multimodal Retrieval on Representation of ImaGe witH TextCode1
CoSMo: Content-Style Modulation for Image Retrieval With Text FeedbackCode1
LinkTransformer: A Unified Package for Record Linkage with Transformer Language ModelsCode1
Fast and Light-Weight Answer Text Retrieval in Dialogue SystemsCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
Learning Video Context as Interleaved Multimodal SequencesCode1
Extending Multi-modal Contrastive RepresentationsCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Show:102550
← PrevPage 9 of 27Next →

No leaderboard results yet.