SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 51100 of 671 papers

TitleStatusHype
Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense FeaturesCode0
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution ErrorsCode0
PeerQA: A Scientific Question Answering Dataset from Peer ReviewsCode1
LSTM-based Selective Dense Text Retrieval Guided by Sparse Lexical Retrieval0
Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach0
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal UnderstandingCode3
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions0
Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality InversionCode2
Expertized Caption Auto-Enhancement for Video-Text RetrievalCode0
Scientometric Analysis of the German IR Community within TREC & CLEF0
Large Vision-Language Models for Knowledge-Grounded Data Annotation of MemesCode0
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language ModelsCode7
MASS: Overcoming Language Bias in Image-Text Matching0
TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval0
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific LiteratureCode2
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training0
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts0
Retaining Knowledge and Enhancing Long-Text Representations in CLIP through Dual-Teacher Distillation0
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR0
Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment0
CaReBench: A Fine-Grained Benchmark for Video Captioning and Retrieval0
The Text Classification Pipeline: Starting Shallow going Deeper0
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based SearchCode1
Optimizing Multi-Stage Language Models for Effective Text Retrieval0
Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching0
Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text RetrievalCode0
Where am I? Cross-View Geo-localization with Natural Language DescriptionsCode2
PolySmart @ TRECVid 2024 Medical Video Question Answering0
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval0
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question AnsweringCode0
I0T: Embedding Standardization Method Towards Zero Modality GapCode1
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information RetrievalCode1
Establishing a Foundation for Tetun Ad-Hoc Text Retrieval: Stemming, Indexing, Retrieval, and Ranking0
Gramian Multimodal Representation Learning and AlignmentCode2
jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images0
Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses0
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning0
VladVA: Discriminative Fine-tuning of LVLMs0
Linq-Embed-Mistral Technical Report0
Adaptive Two-Phase Finetuning LLMs for Japanese Legal Text Retrieval0
DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding0
Approximate Fiber Product: A Preliminary Algebraic-Geometric Perspective on Multimodal Embedding Alignment0
CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectivesCode0
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language ModelsCode2
Knowledge Transfer Across Modalities with Natural Language Supervision0
Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval0
Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training0
A Comparative Study of Text Retrieval Models on DaReCzech0
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Show:102550
← PrevPage 2 of 14Next →

No leaderboard results yet.