SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 401450 of 671 papers

TitleStatusHype
HGAN: Hierarchical Graph Alignment Network for Image-Text Retrieval0
Retrieval-based Disentangled Representation Learning with Natural Language Supervision0
Benchmarking Robustness of Multimodal Image-Text Models under Distribution ShiftCode1
FlexiViT: One Model for All Patch SizesCode1
Pre-trained Language Models Can be Fully Zero-Shot LearnersCode0
NLIP: Noise-robust Language-Image Pre-training0
Attentive Deep Neural Networks for Legal Document Retrieval0
Scale-Semantic Joint Decoupling Network for Image-text Retrieval in Remote Sensing0
DialogCC: An Automated Pipeline for Creating High-Quality Multi-Modal Dialogue DatasetCode1
Named Entity and Relation Extraction with Multi-Modal Retrieval0
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval0
Dense Text Retrieval based on Pretrained Language Models: A SurveyCode2
ComCLIP: Training-Free Compositional Image and Text MatchingCode1
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion LearningCode1
MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal0
On Negative Sampling for Contrastive Audio-Text Retrieval0
Arabic Text Mining0
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval0
Exploring Train and Test-Time Augmentations for Audio-Language Learning0
Generative Negative Text Replay for Continual Vision-Language Pretraining0
COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust LearningCode1
RSVG: Exploring Data and Models for Visual Grounding on Remote Sensing Data0
Dissecting Deep Metric Learning Losses for Image-Text RetrievalCode0
SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval0
An Analysis of Fusion Functions for Hybrid Retrieval0
Image-Text Retrieval with Binary and Continuous Label Supervision0
VTC: Improving Video-Text Retrieval with User CommentsCode1
MedCLIP: Contrastive Learning from Unpaired Medical Images and TextCode2
Vision-Language Pre-training: Basics, Recent Advances, and Future TrendsCode3
MTEB: Massive Text Embedding BenchmarkCode4
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQACode1
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning0
Learning to embed semantic similarity for joint image-text retrieval0
Nonparametric Decoding for Generative RetrievalCode1
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language ModelCode1
Label Smoothing for Text Mining0
Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss0
DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge BasesCode1
Re-Imagen: Retrieval-Augmented Text-to-Image Generator0
TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval0
Mr. Right: Multimodal Retrieval on Representation of ImaGe witH TextCode1
Audio Retrieval with WavText5K and CLAP TrainingCode1
Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval0
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval0
OmniVL:One Foundation Model for Image-Language and Video-Language Tasks0
CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation AlignmentCode2
Unified Generative & Dense Retrieval for Query Rewriting in Sponsored Search0
VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language ModelsCode0
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Show:102550
← PrevPage 9 of 14Next →

No leaderboard results yet.