SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 601650 of 671 papers

TitleStatusHype
SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI0
Semantic Indexing of Multilingual Corpora and its Application on the History Domain0
Semantic Search by Latent Ontological Features0
SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs0
Show, Recall, and Tell: Image Captioning with Recall Mechanism0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features0
SILC: Improving Vision Language Pretraining with Self-Distillation0
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval0
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval0
SparTerm: Learning Term-based Sparse Representation for Fast Text Retrieval0
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities0
Spotify at TREC 2020: Genre-Aware Abstractive Podcast Summarization0
Stacked Convolutional Deep Encoding Network for Video-Text Retrieval0
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens0
Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval0
SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval0
Switch-BERT: Learning to Model Multimodal Interactions by Switching Attention and Input0
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval0
Tailoring Table Retrieval from a Field-aware Hybrid Matching Perspective0
TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model0
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval0
Memory^3: Language Modeling with Explicit Memory0
Text Relatedness Based on a Word Thesaurus0
Text Retrieval by Term Co-occurrences in a Query-based Vector Space0
Text Retrieval for Language Learners: Graded Vocabulary vs. Open Learner Model0
The effects of having lists of synonyms on the performance of Afaan Oromo Text Retrieval system0
The style transformer with common knowledge optimization for image-text retrieval0
The Text Classification Pipeline: Starting Shallow going Deeper0
The VISIONE Video Search System: Exploiting Off-the-Shelf Text Search Engines for Large-Scale Video Retrieval0
TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval0
TOME: A Two-stage Approach for Model-based Retrieval0
Towards a Visual-Language Foundation Model for Computational Pathology0
Towards Robust Ranker for Text Retrieval0
Towards Understanding Camera Motions in Any Video0
Transformation of XML Documents with Prolog0
Transformer Based Language Models for Similar Text Retrieval and Ranking0
TRAttack”:" Text Rewriting Attack Against Text Retrieval0
TREC 2020 Podcasts Track Overview0
TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval0
UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training0
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning0
Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval0
Unambiguous Text Localization and Retrieval for Cluttered Scenes0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval0
Dynamic Visual Semantic Sub-Embeddings and Fast Re-Ranking0
Uncertainty-aware sign language video retrieval with probability distribution modeling0
Show:102550
← PrevPage 13 of 14Next →

No leaderboard results yet.