SOTAVerified

Video Retrieval

The objective of video retrieval is as follows: given a text query and a pool of candidate videos, select the video which corresponds to the text query. Typically, the videos are returned as a ranked list of candidates and scored via document retrieval metrics.

Papers

Showing 110 of 486 papers

TitleStatusHype
MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed0
Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval0
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained VideosCode0
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review0
Learning World Models for Interactive Video Generation0
A Challenge to Build Neuro-Symbolic Video AgentsCode0
LoVR: A Benchmark for Long Video Retrieval in Multimodal ContextsCode1
Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video RetrievalCode0
Video-GPT via Next Clip DiffusionCode1
CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture0
Show:102550
← PrevPage 1 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1InternVideo2-6Btext-to-video R@161.4Unverified
2InternVideo2-6Btext-to-video R@159.3Unverified
3HunYuan_tvr (huge)text-to-video R@159Unverified
4InternVideotext-to-video R@158.4Unverified
5HunYuan_tvrtext-to-video R@158.2Unverified
6vid-TLDR (UMT-L)text-to-video R@157.9Unverified
7VLABtext-to-video R@157.5Unverified
8MDMMT-2text-to-video R@156.8Unverified
9Side4Videotext-to-video R@156.1Unverified
10Cap4Videotext-to-video R@151.8Unverified