SOTAVerified

Text to Video Retrieval

She's gone I can't find her anywhere I'm looking everywhere for her Everywhere is dark

Papers

Showing 7175 of 75 papers

TitleStatusHype
Retrieving and Highlighting Action with Spatiotemporal Reference0
Condensed Movies: Story Based Retrieval with Contextual EmbeddingsCode1
Noise Estimation Using Density Estimation for Self-Supervised Multimodal LearningCode0
End-to-End Learning of Visual Representations from Uncurated Instructional VideosCode1
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsCode1
Show:102550
← PrevPage 8 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1FROZEN-revisedmAP23.39Unverified
2FROZEN-revised (two-stream)text-to-video R@112.8Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4Cliptext-to-video R@144.5Unverified
#ModelMetricClaimedVerifiedStatus
1X-CLIP (Cross-Lingual)R@132.3Unverified