SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 301350 of 671 papers

TitleStatusHype
Establishing a Foundation for Tetun Ad-Hoc Text Retrieval: Stemming, Indexing, Retrieval, and Ranking0
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images0
Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses0
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning0
VladVA: Discriminative Fine-tuning of LVLMs0
Linq-Embed-Mistral Technical Report0
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval0
DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding0
Approximate Fiber Product: A Preliminary Algebraic-Geometric Perspective on Multimodal Embedding Alignment0
CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectivesCode0
Knowledge Transfer Across Modalities with Natural Language Supervision0
Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval0
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training0
CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval0
A Comparative Study of Text Retrieval Models on DaReCzech0
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?0
Partial Scene Text RetrievalCode0
MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs0
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities0
Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization0
Multilingual Vision-Language Pre-training for the Remote Sensing DomainCode0
Do Audio-Language Models Understand Linguistic Variations?0
GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric LearningCode0
Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging0
Beyond Coarse-Grained Matching in Video-Text Retrieval0
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning0
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning0
AnyAttack: Towards Large-scale Self-supervised Adversarial Attacks on Vision-language Models0
CoLLAP: Contrastive Long-form Language-Audio Pretraining with Musical Temporal Structure Augmentation0
From Unimodal to Multimodal: Scaling up Projectors to Align ModalitiesCode0
Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization0
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval0
NEVLP: Noise-Robust Framework for Efficient Vision-Language Pre-training0
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG0
Pushing the Limits of Vision-Language Models in Remote Sensing without Human Annotations0
Benchmarking and Building Zero-Shot Hindi Retrieval Model with Hindi-BEIR and NLLB-E50
MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation FunctionsCode0
Mistral-SPLADE: LLMs for better Learned Sparse RetrievalCode0
Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores0
NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality0
Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense Retrieval0
Pairing Clustered Inverted Indexes with kNN Graphs for Fast Approximate Retrieval over Learned Sparse Representations0
Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation0
GABInsight: Exploring Gender-Activity Binding Bias in Vision-Language ModelsCode0
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysisCode0
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval0
Assessing Brittleness of Image-Text Retrieval Benchmarks from Vision-Language Models Perspective0
Multimodal Misinformation Detection using Large Vision-Language Models0
Object-Aware Query Perturbation for Cross-Modal Image-Text RetrievalCode0
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?0
Show:102550
← PrevPage 7 of 14Next →

No leaderboard results yet.