SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 711–720 of 1149 papers

Title	Date	Tasks	Status	Hype
Dual-path Adaptation from Image to Video Transformers	Mar 17, 2023	Action ClassificationAction Recognition	CodeCode Available	1
TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization	Mar 16, 2023	Action LocalizationTemporal Action Localization	CodeCode Available	1
Localizing Moments in Long Video Via Multimodal Guidance	Feb 26, 2023	Natural Language Moment RetrievalNatural Language Visual Grounding	CodeCode Available	1
Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks	Feb 24, 2023	ClassificationData Augmentation	—Unverified	0
MINOTAUR: Multi-task Video Grounding From Multimodal Queries	Feb 16, 2023	Action DetectionSentence	CodeCode Available	0
AIM: Adapting Image Models for Efficient Video Action Recognition	Feb 6, 2023	Action ClassificationAction Recognition	CodeCode Available	2
Semi-Parametric Video-Grounded Text Generation	Jan 27, 2023	Language ModelingLanguage Modelling	—Unverified	0
Building Scalable Video Understanding Benchmarks through Sports	Jan 17, 2023	Video Understanding	—Unverified	0
STPrivacy: Spatio-Temporal Privacy-Preserving Action Recognition	Jan 8, 2023	Action RecognitionFacial Expression Recognition (FER)	—Unverified	0
Test of Time: Instilling Video-Language Models with a Sense of Time	Jan 5, 2023	Video-Text RetrievalVideo Understanding	CodeCode Available	1

Show:10 25 50

← PrevPage 72 of 115Next →

No leaderboard results yet.