SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 7180 of 111 papers

TitleStatusHype
HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models0
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval0
Video Editing for Video Retrieval0
Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning0
Harvest Video Foundation Models via Efficient Post-Pretraining0
Videoprompter: an ensemble of foundational models for zero-shot video understanding0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending0
Show:102550
← PrevPage 8 of 12Next →

No leaderboard results yet.