SOTAVerified

Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Showing 751800 of 817 papers

TitleStatusHype
R-C3D: Region Convolutional 3D Network for Temporal Activity DetectionCode0
A Pursuit of Temporal Accuracy in General Activity DetectionCode0
Efficient Action Detection in Untrimmed Videos via Multi-Task Learning0
Temporal Tessellation: A Unified Approach for Video AnalysisCode0
Temporal-Needle: A view and appearance invariant video descriptor0
Unsupervised Human Action Detection by Action Matching0
Video Event Detection by Exploiting Word Dependencies from Image Captions0
An End-to-End Architecture for Keyword Spotting and Voice Activity DetectionCode1
Learning recurrent representations for hierarchical behavior modeling0
Real-time Online Action Detection Forests using Spatio-temporal Contexts0
Review of Action Recognition and Detection MethodsCode0
Multi-region two-stream R-CNN for action detection0
Temporal Activity Detection in Untrimmed Videos with Recurrent Neural NetworksCode0
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos0
Efficient Activity Detection in Untrimmed Video with Max-Subgraph Search0
Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection0
Untrimmed Video Classification for Activity Detection: submission to ActivityNet ChallengeCode0
Aggressive actions and anger detection from multiple modalities using Kinect0
Hand Action Detection from Ego-centric Depth Sequences with Error-correcting Hough Transform0
A Multi-Stream Bi-Directional Recurrent Neural Network for Fine-Grained Action Detection0
Progressively Parsing Interactional Objects for Fine Grained Action Detection0
Learning Activity Progression in LSTMs for Activity Detection and Early Detection0
Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos0
Temporal Action Detection Using a Statistical Language ModelCode0
Actionness Estimation Using Hybrid Fully Convolutional Networks0
Online Action Detection0
Online Human Action Detection using Joint Classification-Regression Recurrent Neural NetworksCode0
Kernel-based Sensor Fusion with Application to Audio-Visual Voice Activity Detection0
Leaving Some Stones Unturned: Dynamic Feature Prioritization for Activity Detection in Streaming Video0
Cross-modal Supervision for Learning Active Speaker Detection in Video0
Fast Optical Flow using Dense Inverse Search0
Action Detection by Implicit Intentional Motion Clustering0
Actionness-Assisted Recognition of Actions0
End-to-end Learning of Action Detection from Frame Glimpses in VideosCode0
Application of Machine Learning Techniques in Human Activity Recognition0
A Novel Approach for Human Action Recognition from Silhouette Images0
Tensor vs Matrix Methods: Robust Tensor Decomposition under Block Sparse Perturbations0
Continuous control with deep reinforcement learningCode1
The Cohort and Speechify Libraries for Rapid Construction of Speech Enabled Applications for Android0
User Adaptive Restoration for Incorrectly-Segmented Utterances in Spoken Dialogue Systems0
Online Anomaly Detection via Class-Imbalance Learning0
Fast Action Proposals for Human Action Detection and Search0
Encoding Based Saliency Detection for Videos and Images0
ActivityNet: A Large-Scale Video Benchmark for Human Activity UnderstandingCode0
Group Event Detection with a Varying Number of Group Members for Video Surveillance0
A new network-based algorithm for human activity recognition in video0
Linear-time Online Action Detection From 3D Skeletal Data Using Bags of Gesturelets0
A Survey on Recent Advances of Computer Vision Algorithms for Egocentric Video0
Voice Activity Detection using Temporal Characteristics of Autocorrelation Lag and Maximum Spectral Amplitude in Sub-bands0
Multiple Instance Reinforcement Learning for Efficient Weakly-Supervised Detection in Images0
Show:102550
← PrevPage 16 of 17Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1STAR/LFrame-mAP 0.590.3Unverified
2SiAFrame-mAP 0.588.5Unverified
3YOWO + LFBFrame-mAP 0.587.3Unverified
4HITFrame-mAP 0.584.8Unverified
5HISAN (ResNet-101 + FPN)Video-mAP 0.282.3Unverified
6YOWOFrame-mAP 0.580.4Unverified
7Two-in-one Two StreamVideo-mAP 0.278.48Unverified
8MOCFrame-mAP 0.577.8Unverified
9Faster-RCNN + two-stream I3D convFrame-mAP 0.576.3Unverified
10Two-in-oneVideo-mAP 0.275.48Unverified
#ModelMetricClaimedVerifiedStatus
1SiAFrame-mAP 0.588.5Unverified
2HISAN (ResNet-101 + FPN)Video-mAP 0.287.59Unverified
3HITFrame-mAP 0.583.8Unverified
4HISAN (VGG-16)Frame-mAP 0.576.72Unverified
5DTSVideo-mAP 0.276.1Unverified
6YOWO + LFBFrame-mAP 0.575.7Unverified
7Two-in-one Two StreamVideo-mAP 0.574.74Unverified
8YOWOFrame-mAP 0.574.4Unverified
9MOCFrame-mAP 0.574Unverified
10Faster-RCNN + two-stream I3D convFrame-mAP 0.573.3Unverified
#ModelMetricClaimedVerifiedStatus
1TTMmAP28.79Unverified
2CTRNmAP27.8Unverified
3Coarse-Fine Networks (w/ self-supervised detection pretraining)mAP26.95Unverified
4UniMD+Sync. (RGB+Flow)mAP26.53Unverified
5PDAN (RGB+Flow)mAP26.5Unverified
6PATmAP26.5Unverified
7MS-TCT (RGB only)mAP25.4Unverified
83D ResNet-50 + super-events pretrained on AViDmAP25.2Unverified
9Coarse-Fine NetworksmAP25.1Unverified
10MLAD (RGB + Flow)mAP23.7Unverified
#ModelMetricClaimedVerifiedStatus
1MLADmAP51.5Unverified
2CTRNmAP51.2Unverified
3PDANmAP47.6Unverified
4TGMmAP46.4Unverified
5MS-TCT (RGB only)mAP43.1Unverified
6I3D + our super-eventmAP36.4Unverified
7Two-stream + LSTMmAP28.1Unverified
8Two-streammAP27.6Unverified
#ModelMetricClaimedVerifiedStatus
1Two-in-one Two StreamVideo-mAP 0.596.52Unverified
2DTSVideo-mAP 0.294.3Unverified
3Two-in-oneVideo-mAP 0.592.74Unverified
4T-CNNFrame-mAP 0.586.7Unverified
5MR-TS R-CNNFrame-mAP 0.584.52Unverified
6TS R-CNNFrame-mAP 0.582.3Unverified
7Action TubesFrame-mAP 0.568.1Unverified
#ModelMetricClaimedVerifiedStatus
1MAT (Ours) TransmAP71.6Unverified
2TadML-two streammAP59.7Unverified
3MAT (ours)mAP58.2Unverified
4TadML-rgbmAP53.46Unverified
#ModelMetricClaimedVerifiedStatus
1HITFrame-mAP 0.533.3Unverified
2SiAFrame-mAP 0.528.8Unverified
#ModelMetricClaimedVerifiedStatus
1MS-TCTFrame-mAP33.7Unverified
2PDANFrame-mAP32.7Unverified
#ModelMetricClaimedVerifiedStatus
1STCNNIoU0.14Unverified
2Two Stream NetworkIoU0.07Unverified
#ModelMetricClaimedVerifiedStatus
1STCNN-V2 (Vote decision)IoU0.52Unverified
2RGB and PRGBIoU0.35Unverified
#ModelMetricClaimedVerifiedStatus
1PATmAP44.6Unverified