SOTAVerified

Video Retrieval

The objective of video retrieval is as follows: given a text query and a pool of candidate videos, select the video which corresponds to the text query. Typically, the videos are returned as a ranked list of candidates and scored via document retrieval metrics.

Papers

Showing 110 of 486 papers

TitleStatusHype
MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed0
Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval0
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained VideosCode0
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review0
Learning World Models for Interactive Video Generation0
A Challenge to Build Neuro-Symbolic Video AgentsCode0
LoVR: A Benchmark for Long Video Retrieval in Multimodal ContextsCode1
Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video RetrievalCode0
Video-GPT via Next Clip DiffusionCode1
CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture0
Show:102550
← PrevPage 1 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternVideo2-6Btext-to-video R@174.1Unverified
2VASTtext-to-video R@170.5Unverified
3VALORtext-to-video R@170.1Unverified
4GRAMtext-to-video R@169.9Unverified
5COSAtext-to-video R@167.3Unverified
6UMT-L (ViT-L/16)text-to-video R@166.8Unverified
7vid-TLDR (UMT-L)text-to-video R@166.7Unverified
8InternVideotext-to-video R@162.2Unverified
9CLIP-ViPtext-to-video R@161.4Unverified
10GRAMtext-to-video R@159Unverified