SOTAVerified

Video Retrieval

The objective of video retrieval is as follows: given a text query and a pool of candidate videos, select the video which corresponds to the text query. Typically, the videos are returned as a ranked list of candidates and scored via document retrieval metrics.

Papers

Showing 451486 of 486 papers

TitleStatusHype
Video retrieval based on deep convolutional neural network0
Unsupervised Segmentation of Action Segments in Egocentric Videos using Gaze0
Learning from Video and Text via Large-Scale Discriminative ClusteringCode0
An Improved Video Analysis using Context based Extension of LSH0
Unified Embedding and Metric Learning for Zero-Exemplar Event Detection0
Dense-Captioning Events in VideosCode1
Efficient Action Detection in Untrimmed Videos via Multi-Task Learning0
Binary Subspace Coding for Query-by-Image Video Retrieval0
Real-time analysis of cataract surgery videos using statistical models0
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering0
Learning Language-Visual Embedding for Movie Understanding with Natural-Language0
Sharing Hash Codes for Multiple Purposes0
Learning Joint Representations of Videos and Sentences with Web Image Search0
Large-Scale Query-by-Image Video Retrieval Using Bloom Filters0
De-Hashing: Server-Side Context-Aware Feature Reconstruction for Mobile Visual Search0
Strategies for Searching Video Content with Text Queries or Video Examples0
Ego-Surfing: Person Localization in First-Person Videos Using Ego-Motion Signatures0
Deep Learning Based Semantic Video Indexing and Retrieval0
VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products0
Semantic Video Entity Linking Based on Visual Content and Metadata0
Multimodal Skip-gram Using Convolutional Pseudowords0
Circulant temporal encoding for video retrieval and temporal alignmentCode0
Face Video Retrieval With Image Query via Hashing Across Euclidean Space and Riemannian Manifold0
Bag of Genres for Video Retrieval0
Visual Information Retrieval in Endoscopic Video Archives0
Discrete Wavelet Transform and Gradient Difference based approach for text localization in videos0
Advances in Human Action Recognition: A Survey0
A Faster Method for Tracking and Scoring Videos Corresponding to Sentences0
Analysis of Gait Pattern to Recognize the Human Activities0
Visual Semantic Search: Retrieving Videos via Complex Textual Queries0
KPCA Spatio-temporal trajectory point cloud classifier for recognizing human actions in a CBVR system0
Classroom Video Assessment and Retrieval via Multiple Instance Learning0
System Analysis And Design For Multimedia Retrieval Systems0
Multimodal Approach for Video Surveillance Indexing and Retrieval0
Learning Locally-Adaptive Decision Functions for Person Verification0
Two-person interaction detection using body-pose features and multiple instance learning0
Show:102550
← PrevPage 10 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniVectext-to-video R@1089.4Unverified
2CLIP4Cliptext-to-video R@1081.6Unverified
3OmniVec (pretrained)text-to-video R@1078.6Unverified
4HunYuan_tvr (huge)text-to-video R@162.9Unverified
5CLIP-ViPtext-to-video R@157.7Unverified
6PIDRotext-to-video R@155.9Unverified
7DMAE (ViT-B/16)text-to-video R@155.5Unverified
8HunYuan_tvrtext-to-video R@155Unverified
9MuLTItext-to-video R@154.7Unverified
10EERCFtext-to-video R@154.1Unverified
#ModelMetricClaimedVerifiedStatus
1Aurora (ours, r=64)text-to-video R@577.4Unverified
2InternVideo2-6Btext-to-video R@174.2Unverified
3vid-TLDR (UMT-L)text-to-video R@172.3Unverified
4VASTtext-to-video R@172Unverified
5COSAtext-to-video R@170.5Unverified
6UMT-L (ViT-L/16)text-to-video R@170.4Unverified
7GRAMtext-to-video R@167.3Unverified
8VALORtext-to-video R@161.5Unverified
9TESTA (ViT-B/16)text-to-video R@161.2Unverified
10VindLUtext-to-video R@161.2Unverified
#ModelMetricClaimedVerifiedStatus
1GRAMtext-to-video R@164Unverified
2VASTtext-to-video R@163.9Unverified
3InternVideo2-6Btext-to-video R@162.8Unverified
4VALORtext-to-video R@159.9Unverified
5UMT-L (ViT-L/16)text-to-video R@158.8Unverified
6vid-TLDR (UMT-L)text-to-video R@158.1Unverified
7COSAtext-to-video R@157.9Unverified
8InternVideo2-6Btext-to-video R@155.9Unverified
9InternVideotext-to-video R@155.2Unverified
10VLABtext-to-video R@155.1Unverified
#ModelMetricClaimedVerifiedStatus
1EMCL-Net (Ours)++ LSMDC Rohrbach et al. (2015)text-to-video R@1053.7Unverified
2InternVideo2-6Btext-to-video R@146.4Unverified
3vid-TLDR (UMT-L)text-to-video R@143.1Unverified
4UMT-L (ViT-L/16)text-to-video R@143Unverified
5HunYuan_tvr (huge)text-to-video R@140.4Unverified
6COSAtext-to-video R@139.4Unverified
7mPLUG-2text-to-video R@134.4Unverified
8VALORtext-to-video R@134.2Unverified
9InternVideotext-to-video R@134Unverified
10InternVideo2-6Btext-to-video R@133.8Unverified