SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 7180 of 111 papers

TitleStatusHype
HiVLP: Hierarchical Interactive Video-Language Pre-Training0
Beyond Coarse-Grained Matching in Video-Text Retrieval0
HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval0
Stacked Convolutional Deep Encoding Network for Video-Text Retrieval0
HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval0
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval0
HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models0
Show:102550
← PrevPage 8 of 12Next →

No leaderboard results yet.