Video Understanding

A crucial task of Video Understanding is to recognise and localise (in space and time) different actions or events appearing in the video.

Source: Action Detection from a Robot-Car Perspective

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1001–1025 of 1149 papers

Title	Date	Tasks	Status
Video Action Understanding	Oct 13, 2020	Action UnderstandingDeep Learning	CodeCode Available
Global Self-Attention Networks for Image Recognition	Oct 6, 2020	Video Understanding	—Unverified
Features Understanding in 3D CNNs for Actions Recognition in Video	Oct 1, 2020	Action RecognitionDecision Making	CodeCode Available
Residual Frames with Efficient Pseudo-3D CNN for Human Action Recognition	Aug 3, 2020	Action RecognitionOptical Flow Estimation	—Unverified
Self-supervised Motion Representation via Scattering Local Motion Cues	Aug 1, 2020	Action RecognitionOptical Flow Estimation	—Unverified
Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection	Jul 29, 2020	object-detectionObject Detection	—Unverified
Perceptron Synthesis Network: Rethinking the Action Scale Variances in Videos	Jul 22, 2020	Action RecognitionTemporal Action Localization	—Unverified
MovieNet: A Holistic Dataset for Movie Understanding	Jul 21, 2020	Video Understanding	—Unverified
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training	Jul 5, 2020	DecoderQuestion Answering	—Unverified
Video Understanding as Machine Translation	Jun 12, 2020	Machine TranslationMetric Learning	—Unverified
Screencast Tutorial Video Understanding	Jun 1, 2020	object-detectionObject Detection	CodeCode Available
Large Scale Video Representation Learning via Relational Graph Clustering	Jun 1, 2020	ClusteringGraph Clustering	—Unverified
CARPe Posterum: A Convolutional Approach for Real-time Pedestrian Path Prediction	May 26, 2020	Autonomous VehiclesPrediction	CodeCode Available
DramaQA: Character-Centered Video Story Understanding with Hierarchical QA	May 7, 2020	Question AnsweringVideo Question Answering	CodeCode Available
HLVU : A New Challenge to Test Deep Understanding of Movies the Way Humans do	May 1, 2020	Video Understanding	—Unverified
CATER: A diagnostic dataset for Compositional Actions & TEmporal Reasoning	May 1, 2020	DiagnosticObject	—Unverified
Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube	Apr 29, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
DriftNet: Aggressive Driving Behavior Classification using 3D EfficientNet Architecture	Apr 18, 2020	Anomaly DetectionClassification	CodeCode Available
Knowledge-Based Visual Question Answering in Videos	Apr 17, 2020	Question AnsweringVideo Question Answering	—Unverified
Real-Time Segmentation Networks should be Latency Aware	Apr 6, 2020	Autonomous VehiclesScene Segmentation	—Unverified
Context Modulated Dynamic Networks for Actor and Action Video Segmentation with Language Queries	Apr 3, 2020	Referring Expression SegmentationVideo Segmentation	—Unverified
Fully Automated Hand Hygiene Monitoring\ Operating Room using 3D Convolutional Neural Network	Mar 20, 2020	Optical Flow EstimationTransfer Learning	—Unverified
Beyond the Camera: Neural Networks in World Coordinates	Mar 12, 2020	Action RecognitionVideo Stabilization	—Unverified
CTM: Collaborative Temporal Modeling for Action Recognition	Feb 8, 2020	Action RecognitionVideo Understanding	—Unverified
Cut-Based Graph Learning Networks to Discover Compositional Structure of Sequential Video Data	Jan 17, 2020	Graph LearningVideo Understanding	—Unverified

Show:10 25 50

← PrevPage 41 of 46Next →

No leaderboard results yet.