SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 601625 of 671 papers

TitleStatusHype
SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI0
Semantic Indexing of Multilingual Corpora and its Application on the History Domain0
Semantic Search by Latent Ontological Features0
SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs0
Show, Recall, and Tell: Image Captioning with Recall Mechanism0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features0
SILC: Improving Vision Language Pretraining with Self-Distillation0
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval0
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval0
SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval0
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities0
Spotify at TREC 2020: Genre-Aware Abstractive Podcast Summarization0
Stacked Convolutional Deep Encoding Network for Video-Text Retrieval0
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens0
Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval0
SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval0
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input0
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval0
Tailoring Table Retrieval from a Field-aware Hybrid Matching Perspective0
TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model0
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval0
Memory^3: Language Modeling with Explicit Memory0
Show:102550
← PrevPage 25 of 27Next →

No leaderboard results yet.