SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 201250 of 671 papers

TitleStatusHype
mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge GraphsCode1
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connectionsCode1
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image RecognitionCode1
Benchmarking Robustness of Multimodal Image-Text Models under Distribution ShiftCode1
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip RetrievalCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language RepresentationsCode1
Equivariant Similarity for Vision-Language Foundation ModelsCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
CLIP-Lite: Information Efficient Visual Representation Learning with Language SupervisionCode1
Nearest Neighbor Normalization Improves Multimodal RetrievalCode1
GOAL: Global-local Object Alignment LearningCode1
ReCLAP: Improving Zero Shot Audio Classification by Describing SoundsCode1
Vision-Language Dataset DistillationCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
CrossGET: Cross-Guided Ensemble of Tokens for Accelerating Vision-Language TransformersCode1
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents IntegrationCode1
Multi-modal Pre-training for Medical Vision-language Understanding and Generation: An Empirical Study with A New BenchmarkCode1
Generative Multi-hop RetrievalCode1
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionCode1
CoSMo: Content-Style Modulation for Image Retrieval With Text FeedbackCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain RetrievalCode1
FuseCap: Leveraging Large Language Models for Enriched Fused Image CaptionsCode1
Learning Video Context as Interleaved Multimodal SequencesCode1
Language-agnostic BERT Sentence EmbeddingCode1
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
GLEN: Generative Retrieval via Lexical Index LearningCode1
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQACode1
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal RetrievalCode1
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
Fine-Tuning LLaMA for Multi-Stage Text RetrievalCode1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision TransformersCode1
From Unimodal to Multimodal: Scaling up Projectors to Align ModalitiesCode0
Attacking Attention of Foundation Models Disrupts Downstream TasksCode0
PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval ModelsCode0
Partial Scene Text RetrievalCode0
Pre-trained Language Models Can be Fully Zero-Shot LearnersCode0
On Using GUI Interaction Data to Improve Text Retrieval-based Bug LocalizationCode0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
OTE: Exploring Accurate Scene Text Recognition Using One TokenCode0
Invisible Relevance Bias: Text-Image Retrieval Models Prefer AI-Generated ImagesCode0
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysisCode0
PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional ExpertsCode0
Object-Aware Query Perturbation for Cross-Modal Image-Text RetrievalCode0
Show:102550
← PrevPage 5 of 14Next →

No leaderboard results yet.