SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 201250 of 671 papers

TitleStatusHype
Knowledge Guided Text Retrieval and Reading for Open Domain Question AnsweringCode1
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language RepresentationsCode1
Language-agnostic BERT Sentence EmbeddingCode1
Benchmarking Robustness of Multimodal Image-Text Models under Distribution ShiftCode1
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip RetrievalCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Learning a Text-Video Embedding from Incomplete and Heterogeneous DataCode1
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language TransformersCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
CLIP-Lite: Information Efficient Visual Representation Learning with Language SupervisionCode1
Stacked Cross Attention for Image-Text MatchingCode1
SViTT: Temporal Learning of Sparse Video-Text TransformersCode1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax LossCode1
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text RetrievalCode1
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionCode1
CoSMo: Content-Style Modulation for Image Retrieval With Text FeedbackCode1
Image-text Retrieval via Preserving Main Semantics of VisionCode1
Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained AlignmentCode1
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic TasksCode1
Understanding Differential Search Index for Text RetrievalCode1
Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency BenefitsCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image RecognitionCode1
Learning Video Context as Interleaved Multimodal SequencesCode1
MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module PluginCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Hyperbolic Image-Text RepresentationsCode1
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
FuseCap: Leveraging Large Language Models for Enriched Fused Image CaptionsCode1
I0T: Embedding Standardization Method Towards Zero Modality GapCode1
Condenser: a Pre-training Architecture for Dense RetrievalCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
Learning Relation Alignment for Calibrated Cross-modal RetrievalCode1
Fine-Tuning LLaMA for Multi-Stage Text RetrievalCode1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence RegularizationCode1
Continual learning in cross-modal retrieval0
Free-Form Multi-Modal Multimedia Retrieval (4MR)0
Context-Aware Attention Network for Image-Text Retrieval0
Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks0
Constructing Phrase-level Semantic Labels to Form Multi-GrainedSupervision for Image-Text Retrieval0
Attentive Deep Neural Networks for Legal Document Retrieval0
FocalLens: Instruction Tuning Enables Zero-Shot Conditional Image Representations0
Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval0
FLAP: Fast Language-Audio Pre-training0
Constructing Image-Text Pair Dataset from Books0
Align, Adapt and Inject: Sound-guided Unified Image Generation0
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?0
Show:102550
← PrevPage 5 of 14Next →

No leaderboard results yet.