SOTAVerified|Agents Browse Leaderboard About

Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 991–1000 of 1149 papers

Title	Date	Tasks	Status	Hype
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training	Jul 5, 2020	DecoderQuestion Answering	—Unverified	0
Video Moment Localization using Object Evidence and Reverse Captioning	Jun 18, 2020	Language-Based Temporal LocalizationLanguage Modelling	CodeCode Available	1
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization	Jun 14, 2020	Action DetectionAction Localization	CodeCode Available	1
Video Understanding as Machine Translation	Jun 12, 2020	Machine TranslationMetric Learning	—Unverified	0
Large Scale Video Representation Learning via Relational Graph Clustering	Jun 1, 2020	ClusteringGraph Clustering	—Unverified	0
Screencast Tutorial Video Understanding	Jun 1, 2020	object-detectionObject Detection	CodeCode Available	0
Temporal Aggregate Representations for Long-Range Video Understanding	Jun 1, 2020	Action AnticipationAction Recognition	CodeCode Available	1
CARPe Posterum: A Convolutional Approach for Real-time Pedestrian Path Prediction	May 26, 2020	Autonomous VehiclesPrediction	CodeCode Available	0
DramaQA: Character-Centered Video Story Understanding with Hierarchical QA	May 7, 2020	Question AnsweringVideo Question Answering	CodeCode Available	0
CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning	May 1, 2020	DiagnosticObject	—Unverified	0

Show:10 25 50

← PrevPage 100 of 115Next →

No leaderboard results yet.