SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 5160 of 111 papers

TitleStatusHype
Video Editing for Video Retrieval0
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception0
An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval0
Beyond Coarse-Grained Matching in Video-Text Retrieval0
Boosting Video-Text Retrieval with Explicit High-Level Semantics0
CLIP2TV: Align, Match and Distill for Video-Text Retrieval0
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations0
Deep Learning for Video-Text Retrieval: a Review0
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals0
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
Show:102550
← PrevPage 6 of 12Next →

No leaderboard results yet.