SOTAVerified

Long Video Retrieval (Background Removed)

Retrieve the long videos given all subtitles.

Papers

Showing 16 of 6 papers

TitleStatusHype
Multi-granularity Correspondence Learning from Long-term Noisy VideosCode2
TempCLR: Temporal Alignment Representation with Contrastive LearningCode1
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text UnderstandingCode0
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled VideosCode1
End-to-End Learning of Visual Representations from Uncurated Instructional VideosCode1
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsCode1
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1NortonCap. Avg. R@175.5Unverified
2TempCLRCap. Avg. R@174.5Unverified
3VideoCLIPCap. Avg. R@174.5Unverified
4MCNCap. Avg. R@153.4Unverified
5Text-Video EmbeddingCap. Avg. R@146.6Unverified
6MIL-NCECap. Avg. R@143.1Unverified