SOTAVerified

Text Retrieval

Text Retrieval is the task of finding the most text result (such as an answer, paragraph, or passage) given a query (which could be a question, keywords, or any relevant text)

Papers

Showing 376400 of 671 papers

TitleStatusHype
UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal ModelingCode1
Differentiable Outlier Detection Enable Robust Deep Multimodal AnalysisCode0
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text RetrievalCode1
UPop: Unified and Progressive Pruning for Compressing Vision-Language TransformersCode1
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval0
STAIR: Learning Sparse Text and Image Representation in Grounded Tokens0
Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge TransferringCode1
MV-Adapter: Multimodal Video Transfer Learning for Video Text RetrievalCode1
USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text RetrievalCode0
HADA: A Graph-based Amalgamation Framework in Image-text RetrievalCode0
NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal EmbeddingsCode0
LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse RetrievalCode1
VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching0
HiVLP: Hierarchical Interactive Video-Language Pre-Training0
Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos0
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
Multilateral Semantic Relations Modeling for Image Text Retrieval0
Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning NetworkCode1
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval0
Learning Semantic Relationship Among Instances for Image-Text MatchingCode1
GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks0
When are Lemons Purple? The Concept Association Bias of Vision-Language Models0
Multi-modal Molecule Structure-text Model for Text-based Retrieval and EditingCode2
Efficient Image Captioning for Edge Devices0
AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data AugmentationCode0
Show:102550
← PrevPage 16 of 27Next →

No leaderboard results yet.