SOTAVerified

Video to Text Retrieval

Papers

Showing 113 of 13 papers

TitleStatusHype
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal RetrievalCode1
Bridging Video-text Retrieval with Multiple Choice QuestionsCode1
CLIP2Video: Mastering Video-Text Retrieval via Image CLIPCode1
SignCLIP: Connecting Text and Sign Language by Contrastive LearningCode1
Learning a Text-Video Embedding from Incomplete and Heterogeneous DataCode1
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text RetrievalCode1
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language BenchmarkCode1
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in IndonesianCode0
Socratic Models: Composing Zero-Shot Multimodal Reasoning with LanguageCode0
i-Code Studio: A Configurable and Composable Framework for Integrative AI0
VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners0
SPECTRUM: Semantic Processing and Emotion-informed video-Captioning Through Retrieval and Understanding Modalities0
Sakuga-42M Dataset: Scaling Up Cartoon Research0
Show:102550

No leaderboard results yet.