SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 101125 of 671 papers

TitleStatusHype
CLIP2Video: Mastering Video-Text Retrieval via Image CLIPCode1
CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip RetrievalCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Bridging Language Gaps in Audio-Text RetrievalCode1
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
Eye-gaze Guided Multi-modal Alignment for Medical Representation LearningCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents IntegrationCode1
Fine-grained Video-Text Retrieval with Hierarchical Graph ReasoningCode1
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
Equivariant Similarity for Vision-Language Foundation ModelsCode1
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language BenchmarkCode1
ESA: External Space Attention Aggregation for Image-Text RetrievalCode1
Composing Object Relations and Attributes for Image-Text MatchingCode1
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
Consensus-Aware Visual-Semantic Embedding for Image-Text MatchingCode1
A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and ReportsCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax LossCode1
IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text RetrievalCode1
Nonparametric Decoding for Generative RetrievalCode1
Audio Retrieval with Natural Language Queries: A Benchmark StudyCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Show:102550
← PrevPage 5 of 27Next →

No leaderboard results yet.