SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 91100 of 111 papers

TitleStatusHype
EA-VTR: Event-Aware Video-Text Retrieval0
Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval0
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval0
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals0
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts0
V^2Dial: Unification of Video and Visual Dialog via Multimodal Experts0
Video Editing for Video Retrieval0
Deep Learning for Video-Text Retrieval: a Review0
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations0
Show:102550
← PrevPage 10 of 12Next →

No leaderboard results yet.