SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 501550 of 671 papers

TitleStatusHype
Label Smoothing for Text Mining0
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning0
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval0
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval0
Learning Context-Adapted Video-Text Retrieval by Attending to User Comments0
Learning Joint Visual Semantic Matching Embeddings for Language-guided Retrieval0
Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm0
Learning to embed semantic similarity for joint image-text retrieval0
Learning with Noisy Correspondence0
Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos0
Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning0
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships0
Lifelong learning for text retrieval and recognition in historical handwritten document collections0
LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models0
Linq-Embed-Mistral Technical Report0
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning0
Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models0
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval0
LSTM-based Selective Dense Text Retrieval Guided by Sparse Lexical Retrieval0
LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival0
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders0
M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP0
Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense Retrieval0
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval0
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval0
MASS: Overcoming Language Bias in Image-Text Matching0
Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval0
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval0
Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts0
Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval0
MeSH-based dataset for measuring the relevance of text retrieval0
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval0
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries0
MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction0
MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval0
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs0
MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning0
MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal0
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval0
Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching0
Multilateral Semantic Relations Modeling for Image Text Retrieval0
Multilingual Diversity Improves Vision-Language Representations0
Multimodal Learned Sparse Retrieval for Image Suggestion0
Multimodal Misinformation Detection using Large Vision-Language Models0
Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval0
Multi-Scale Temporal Difference Transformer for Video-Text Retrieval0
Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings0
Named Entity and Relation Extraction with Multi-Modal Retrieval0
NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality0
Show:102550
← PrevPage 11 of 14Next →

No leaderboard results yet.