SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 401450 of 671 papers

TitleStatusHype
CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval0
Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data0
Computationally Efficient Labeling of Cancer Related Forum Posts by Non-Clinical Text Information Retrieval0
Constructing Image-Text Pair Dataset from Books0
Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval0
Constructing Phrase-level Semantic Labels to Form Multi-GrainedSupervision for Image-Text Retrieval0
Context-Aware Attention Network for Image-Text Retrieval0
Continual learning in cross-modal retrieval0
Contrastive Feature Masking Open-Vocabulary Vision Transformer0
Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report0
CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging0
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval0
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations0
Cross-modal Contrastive Learning for Speech Translation0
Cross-Modal Pre-Aligned Method with Global and Local Information for Remote-Sensing Image and Text Retrieval0
A New Fine-grained Alignment Method for Image-text Matching0
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning0
DART: Disease-aware Image-Text Alignment and Self-correcting Re-alignment for Trustworthy Radiology Report Generation0
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions0
Deep Keyphrase Completion0
Deep Learning for Video-Text Retrieval: a Review0
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals0
Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection0
Denmark's Participation in the Search Engine TREC COVID-19 Challenge: Lessons Learned about Searching for Precise Biomedical Scientific Information on COVID-190
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval0
Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval0
DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding0
VladVA: Discriminative Fine-tuning of LVLMs0
Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation0
Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation0
DLIP: Distilling Language-Image Pre-training0
Do Audio-Language Models Understand Linguistic Variations?0
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents0
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
Dual Relation Alignment for Composed Image Retrieval0
Dynamic Contrastive Distillation for Image-Text Retrieval0
EA-VTR: Event-Aware Video-Text Retrieval0
EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling0
Efficient Image Captioning for Edge Devices0
Efficient Image-Text Retrieval via Keyword-Guided Pre-Screening0
Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss0
Embedding-based Retrieval with LLM for Effective Agriculture Information Extracting from Unstructured Data0
End-to-End Autoregressive Retrieval via Bootstrapping for Smart Reply Systems0
Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI0
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG0
Establishing a Foundation for Tetun Ad-Hoc Text Retrieval: Stemming, Indexing, Retrieval, and Ranking0
Evaluating D-MERIT of Partial-annotation on Information Retrieval0
Evaluation of Deep Gaussian Processes for Text Classification0
EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models0
EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE0
Show:102550
← PrevPage 9 of 14Next →

No leaderboard results yet.