SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 451500 of 671 papers

TitleStatusHype
Design of the topology for contrastive visual-textual alignmentCode0
Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal RetrievalCode1
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentCode1
Contrastive Audio-Language Learning for MusicCode1
Revising Image-Text Retrieval via Multi-Modal Entailment0
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval0
VLMAE: Vision-Language Masked Autoencoder0
On the Value of Behavioral Representations for Dense Retrieval0
Boosting Video-Text Retrieval with Explicit High-Level Semantics0
Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval0
X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text RetrievalCode1
Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text RetrieversCode4
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval0
Intra-Modal Constraint Loss For Image-Text RetrievalCode0
GazBy: Gaze-Based BERT Model to Incorporate Human Attention in Neural Information Retrieval0
Dynamic Contrastive Distillation for Image-Text Retrieval0
A Dense Representation Framework for Lexical and Semantic MatchingCode1
Towards Robust Ranker for Text Retrieval0
MixGen: A New Multi-Modal Data AugmentationCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEsCode2
Egocentric Video-Language PretrainingCode2
VL-BEiT: Generative Vision-Language Pretraining0
Cross-lingual and Multilingual CLIPCode2
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition0
Fast and Light-Weight Answer Text Retrieval in Dialogue SystemsCode1
Prompt-based Learning for Unpaired Image Captioning0
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connectionsCode1
HiVLP: Hierarchical Vision-Language Pre-Training for Fast Image-Text Retrieval0
HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer RerankingCode1
CCMB: A Large-scale Chinese Cross-modal BenchmarkCode1
Cross-modal Contrastive Learning for Speech TranslationCode1
Scene-Text Aware Image and Text Retrieval with Dual-Encoder0
TRAttack”:" Text Rewriting Attack Against Text Retrieval0
Generative Multi-hop RetrievalCode1
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text RetrievalCode1
Progressive Learning for Image Retrieval with Hybrid-Modality Queries0
MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENerationCode1
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval0
Robust Cross-Modal Representation Learning with Progressive Self-Distillation0
Socratic Models: Composing Zero-Shot Multimodal Reasoning with LanguageCode0
On Metric Learning for Audio-Text Cross-Modal RetrievalCode1
Image-text Retrieval: A Survey on Recent Research and Development0
Single-Stream Multi-Level Alignment for Vision-Language PretrainingCode0
Audio-text Retrieval in Context0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text RetrievalCode1
LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval0
An Uncommon Task: Participatory Design in Legal AI0
Show:102550
← PrevPage 10 of 14Next →

No leaderboard results yet.