Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 817 papers

Title	Date	Tasks	Status	Hype
SoccerNet: A Scalable Dataset for Action Spotting in Soccer Videos	Apr 12, 2018	Action ClassificationAction Detection	CodeCode Available	1
Fine-grained Activity Recognition in Baseball Videos	Apr 9, 2018	Action DetectionActivity Detection	CodeCode Available	0
Jointly Detecting and Separating Singing Voice: A Multi-Task Approach	Apr 5, 2018	Action DetectionActivity Detection	—Unverified	0
Learning to Anonymize Faces for Privacy Preserving Action Detection	Mar 30, 2018	Action DetectionPrivacy Preserving	CodeCode Available	0
C3PO: Database and Benchmark for Early-stage Malicious Activity Detection in 3D Printing	Mar 20, 2018	Action DetectionActivity Detection	—Unverified	0
Temporal Gaussian Mixture Layer for Videos	Mar 16, 2018	Action DetectionActivity Detection	CodeCode Available	0
Frequency domain TRINICON-based blind source separation method with multi-source activity detection for sparsely mixed signals	Feb 25, 2018	Action DetectionActivity Detection	—Unverified	0
Real-Time End-to-End Action Detection with Two-Stream Networks	Feb 23, 2018	Action DetectionAction Recognition	—Unverified	0
Spatial Morphing Kernel Regression For Feature Interpolation	Feb 21, 2018	Action DetectionActivity Detection	—Unverified	0
Online Detection of Action Start in Untrimmed, Streaming Videos	Feb 19, 2018	Action DetectionGenerative Adversarial Network	—Unverified	0
Structured Label Inference for Visual Understanding	Feb 18, 2018	Action DetectionGeneral Classification	CodeCode Available	0
A Convolutional Neural Network Smartphone App for Real-Time Voice Activity Detection	Feb 1, 2018	Action DetectionActivity Detection	CodeCode Available	0
Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection	Jan 28, 2018	Action DetectionActivity Detection	—Unverified	0
Recursive Binary Neural Network Learning Model with 2-bit/weight Storage Requirement	Jan 1, 2018	Action DetectionActivity Detection	—Unverified	0
Overcomplete Frame Thresholding for Acoustic Scene Analysis	Dec 25, 2017	Action DetectionActivity Detection	—Unverified	0
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification	Dec 13, 2017	Action ClassificationAction Detection	CodeCode Available	0
Learning Latent Super-Events to Detect Multiple Activities in Videos	Dec 5, 2017	Action DetectionActivity Detection	CodeCode Available	0
Graph Distillation for Action Detection with Privileged Modalities	Nov 30, 2017	Action ClassificationAction Detection	CodeCode Available	0
An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos	Nov 30, 2017	Action DetectionAction Segmentation	—Unverified	0
Budget-Aware Activity Detection with A Recurrent Policy Network	Nov 30, 2017	Action DetectionActivity Detection	—Unverified	0
Single Shot Temporal Action Detection	Oct 17, 2017	Action DetectionGeneral Classification	CodeCode Available	0
Real-Time Action Detection in Video Surveillance using Sub-Action Descriptor with Multi-CNN	Oct 10, 2017	Action DetectionAction Recognition	CodeCode Available	0
Joint Learning of Object and Action Detectors	Oct 1, 2017	Action DetectionObject	—Unverified	0
TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal	Oct 1, 2017	Action Detectionregression	—Unverified	0
Protest Activity Detection and Perceived Violence Estimation from Social Media Images	Sep 18, 2017	Action DetectionActivity Detection	CodeCode Available	0
A Nonparametric Model for Multimodal Collaborative Activities Summarization	Sep 4, 2017	Action DetectionActivity Detection	—Unverified	0
A Simple Model for Improving the Performance of the Stanford Parser for Action Detection in Textual Instructions	Sep 1, 2017	Action DetectionPOS	—Unverified	0
Extensible Hierarchical Method of Detecting Interactive Actions for Video Understanding	Aug 11, 2017	Action DetectionAction Recognition	—Unverified	0
EUDAMU at SemEval-2017 Task 11: Action Ranking and Type Matching for End-User Development	Aug 1, 2017	Action Detection	—Unverified	0
Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation	Jul 31, 2017	Action DetectionRegion Proposal	—Unverified	0
SST: Single-Stream Temporal Action Proposals	Jul 1, 2017	Action DetectionTemporal Action Proposal Generation	CodeCode Available	0
SCC: Semantic Context Cascade for Efficient Action Detection	Jul 1, 2017	Action Detection	—Unverified	0
Budget-Aware Deep Semantic Video Segmentation	Jul 1, 2017	Action DetectionActivity Detection	—Unverified	0
A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning	Jun 22, 2017	Action DetectionPosition	CodeCode Available	0
Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection	Jun 9, 2017	Action Detection	—Unverified	0
Action Sets: Weakly Supervised Action Segmentation without Ordering Constraints	Jun 2, 2017	Action DetectionAction Segmentation	CodeCode Available	0
Polish Read Speech Corpus for Speech Tools and Services	Jun 1, 2017	Action DetectionActivity Detection	—Unverified	0
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions	May 23, 2017	Actin DetectionAction Detection	CodeCode Available	1
Am I Done? Predicting Action Progress in Videos	May 4, 2017	Action DetectionTemporal Localization	CodeCode Available	0
Cascaded Boundary Regression for Temporal Action Detection	May 2, 2017	Action Detectionregression	—Unverified	0
Skeleton-based Action Recognition with Convolutional Neural Networks	Apr 25, 2017	Action ClassificationAction Detection	CodeCode Available	1
Temporal Action Detection with Structured Segment Networks	Apr 20, 2017	Action DetectionAction Recognition	CodeCode Available	2
Skeleton Boxes: Solving skeleton based action detection with a single deep convolutional neural network	Apr 19, 2017	Action DetectionAction Recognition	—Unverified	0
AMTnet: Action-Micro-Tube Regression by End-to-end Trainable Deep Architecture	Apr 17, 2017	Action DetectionRegion Proposal	—Unverified	0
Temporal Action Localization by Structured Maximal Sums	Apr 15, 2017	Action DetectionAction Localization	—Unverified	0
Predictive-Corrective Networks for Action Detection	Apr 12, 2017	Action DetectionOptical Flow Estimation	—Unverified	0
Incremental Tube Construction for Human Action Detection	Apr 5, 2017	Action Detection	CodeCode Available	0
Unsupervised Action Proposal Ranking through Proposal Recombination	Apr 3, 2017	Action DetectionAction Recognition	—Unverified	0
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos	Mar 30, 2017	Action Detectionimage-classification	CodeCode Available	0
PKU-MMD: A Large Scale Benchmark for Continuous Multi-Modal Human Action Understanding	Mar 22, 2017	Action DetectionAction Recognition	—Unverified	0

Show:10 25 50

← PrevPage 15 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified