SOTAVerified

Text to Video Retrieval

She's gone I can't find her anywhere I'm looking everywhere for her Everywhere is dark

Papers

Showing 6170 of 75 papers

TitleStatusHype
Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval0
An Empirical Study of Frame Selection for Text-to-Video Retrieval0
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer0
MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in IndonesianCode0
Semantic Role Aware Correlation Transformer for Text to Video RetrievalCode0
TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video RetrievalCode0
Robustness Analysis of Video-Language Models Against Visual and Language PerturbationsCode0
Noise Estimation Using Density Estimation for Self-Supervised Multimodal LearningCode0
ContextIQ: A Multimodal Expert-Based Video Retrieval System for Contextual AdvertisingCode0
Efficient End-to-End Video Question Answering with Pyramidal Multimodal TransformerCode0
Show:102550
← PrevPage 7 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FROZEN-revisedmAP23.39Unverified
2FROZEN-revised (two-stream)text-to-video R@112.8Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4Cliptext-to-video R@144.5Unverified
#ModelMetricClaimedVerifiedStatus
1X-CLIP (Cross-Lingual)R@132.3Unverified