SOTAVerified

Video Retrieval

The objective of video retrieval is as follows: given a text query and a pool of candidate videos, select the video which corresponds to the text query. Typically, the videos are returned as a ranked list of candidates and scored via document retrieval metrics.

Papers

Showing 110 of 486 papers

TitleStatusHype
MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed0
Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval0
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained VideosCode0
Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review0
Learning World Models for Interactive Video Generation0
A Challenge to Build Neuro-Symbolic Video AgentsCode0
LoVR: A Benchmark for Long Video Retrieval in Multimodal ContextsCode1
Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video RetrievalCode0
Video-GPT via Next Clip DiffusionCode1
CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture0
Show:102550
← PrevPage 1 of 49Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniVectext-to-video R@1070.8Unverified
2OmniVec (pretrained)text-to-video R@1064.2Unverified
3VASTtext-to-video R@150.4Unverified
4UniVL + MELTRtext-to-video R@133.7Unverified
5VideoCLIPtext-to-video R@132.2Unverified
6MDMMT-2text-to-video R@132Unverified
7TACotext-to-video R@129.6Unverified
8UniVLtext-to-video R@128.9Unverified
9VLMtext-to-video R@127.05Unverified
10VideoCLIPtext-to-video R@122.7Unverified