SOTAVerified

Video to Text Retrieval

Papers

Showing 110 of 13 papers

TitleStatusHype
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities0
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language BenchmarkCode1
SignCLIP: Connecting Text and Sign Language by Contrastive LearningCode1
Sakuga-42M Dataset: Scaling Up Cartoon Research0
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal RetrievalCode1
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in IndonesianCode0
i-Code Studio: A Configurable and Composable Framework for Integrative AI0
VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners0
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text RetrievalCode1
Socratic Models: Composing Zero-Shot Multimodal Reasoning with LanguageCode0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.