SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 7180 of 111 papers

TitleStatusHype
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval0
Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval0
Learning Context-Adapted Video-Text Retrieval by Attending to User Comments0
Learning with Noisy Correspondence0
Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning0
LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders0
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval0
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval0
Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval0
Multi-Scale Temporal Difference Transformer for Video-Text Retrieval0
Show:102550
← PrevPage 8 of 12Next →

No leaderboard results yet.