SOTAVerified

Video-Text Retrieval

Video-Text retrieval requires understanding of both video and language together. Therefore it's different to video retrieval task.

Papers

Showing 76100 of 111 papers

TitleStatusHype
Videoprompter: an ensemble of foundational models for zero-shot video understanding0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval0
Uncertainty-Aware Alignment Network for Cross-Domain Video-Text Retrieval0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending0
Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval0
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception0
CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive LearningCode0
Deep Learning for Video-Text Retrieval: a Review0
Video-Text Retrieval by Supervised Sparse Multi-Grained LearningCode0
Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval0
Dual Alignment Unsupervised Domain Adaptation for Video-Text Retrieval0
HiVLP: Hierarchical Interactive Video-Language Pre-Training0
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval0
Masked Contrastive Pre-Training for Efficient Video-Text Retrieval0
TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval0
Unified Loss of Pair Similarity Optimization for Vision-Language Retrieval0
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval0
OmniVL:One Foundation Model for Image-Language and Video-Language Tasks0
Boosting Video-Text Retrieval with Explicit High-Level Semantics0
LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval0
Generalizing Multimodal Pre-training into Multilingual via Language Acquisition0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding0
CLIP2TV: Align, Match and Distill for Video-Text Retrieval0
Show:102550
← PrevPage 4 of 5Next →

No leaderboard results yet.