SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 561–570 of 1149 papers

Title	Date	Tasks	Status	Hype
GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning	Jun 19, 2025	Multimodal Reasoningreinforcement-learning	—Unverified	0
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection	Apr 20, 2025	Action DetectionDecoder	—Unverified	0
Cultivating DNN Diversity for Large Scale Video Labelling	Jul 13, 2017	DiversityVideo Understanding	—Unverified	0
Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation	Mar 30, 2021	Action DetectionTemporal Action Proposal Generation	—Unverified	0
Grounding Action Descriptions in Videos	Jan 1, 2013	Semantic Textual SimilarityVideo Understanding	—Unverified	0
Grounded Video Situation Recognition	Oct 19, 2022	DescriptiveStructured Prediction	—Unverified	0
CTM: Collaborative Temporal Modeling for Action Recognition	Feb 8, 2020	Action RecognitionVideo Understanding	—Unverified	0
CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding	Jan 17, 2024	Contrastive Learningpoint cloud video understanding	—Unverified	0
Audio-visual training for improved grounding in video-text LLMs	Jul 21, 2024	Video Understanding	—Unverified	0
Motion Sensitive Contrastive Learning for Self-supervised Video Representation	Aug 12, 2022	Contrastive LearningRepresentation Learning	—Unverified	0

Show:10 25 50

← PrevPage 57 of 115Next →

No leaderboard results yet.