SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 6170 of 111 papers

TitleStatusHype
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
EA-VTR: Event-Aware Video-Text Retrieval0
Exploiting Visual Semantic Reasoning for Video-Text Retrieval0
CaReBench: A Fine-Grained Benchmark for Video Captioning and Retrieval0
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition0
Harvest Video Foundation Models via Efficient Post-Pretraining0
HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models0
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model0
HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval0
HiVLP: Hierarchical Interactive Video-Language Pre-Training0
Show:102550
← PrevPage 7 of 12Next →

No leaderboard results yet.