Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 817 papers

Title	Date	Tasks	Status
The Instantaneous Accuracy: a Novel Metric for the Problem of Online Human Behaviour Recognition in Untrimmed Videos	Mar 22, 2020	Action DetectionOnline Action Detection	CodeCode Available
Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation	Mar 20, 2020	Action Detection	—Unverified
A Novel Online Action Detection Framework from Untrimmed Video Streams	Mar 17, 2020	Action DetectionAction Localization	—Unverified
ZSTAD: Zero-Shot Temporal Activity Detection	Mar 12, 2020	Action DetectionActivity Detection	—Unverified
Cross modal video representations for weakly supervised active speaker localization	Mar 9, 2020	Action DetectionActive Speaker Localization	—Unverified
Argus: Efficient Activity Detection System for Extended Video Analysis	Mar 2, 2020	Action DetectionActivity Detection	—Unverified
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team	Feb 23, 2020	Action DetectionActivity Detection	—Unverified
Back to the Future: Joint Aware Temporal Deep Learning 3D Human Pose Estimation	Feb 22, 2020	3D Human Pose EstimationAction Detection	CodeCode Available
Human Activity Recognition: A Spatio-temporal Image Encoding of 3D Skeleton Data for Online Action Detection	Feb 8, 2020	Action DetectionActivity Recognition	CodeCode Available
3D ResNet with Ranking Loss Function for Abnormal Activity Detection in Videos	Feb 4, 2020	Action DetectionAction Recognition	—Unverified
End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection	Feb 3, 2020	Action DetectionActivity Detection	—Unverified
Faster Activity and Data Detection in Massive Random Access: A Multi-armed Bandit Approach	Jan 28, 2020	Action DetectionActivity Detection	—Unverified
A Comprehensive Study on Temporal Modeling for Online Action Detection	Jan 21, 2020	Action DetectionOnline Action Detection	CodeCode Available
Personalized Activity Recognition with Deep Triplet Embeddings	Jan 15, 2020	Action DetectionActivity Detection	CodeCode Available
End-Point Detection with State Transition Model based on Chunk-Wise Classification	Dec 22, 2019	Action DetectionActivity Detection	—Unverified
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition	Dec 11, 2019	Action ClassificationAction Detection	CodeCode Available
SoccerDB: A Large-Scale Database for Comprehensive Video Understanding	Dec 10, 2019	Action ClassificationAction Detection	CodeCode Available
Learning to Discriminate Information for Online Action Detection	Dec 10, 2019	Action DetectionOnline Action Detection	CodeCode Available
Video action detection by learning graph-based spatio-temporal interactions	Dec 9, 2019	Action DetectionAction Localization	CodeCode Available
DASZL: Dynamic Action Signatures for Zero-shot Learning	Dec 8, 2019	Action DetectionActivity Detection	—Unverified
SRG: Snippet Relatedness-based Temporal Action Proposal Generator	Nov 26, 2019	Action DetectionTemporal Action Proposal Generation	—Unverified
Zero-Shot Imitating Collaborative Manipulation Plans from YouTube Cooking Videos	Nov 25, 2019	Action Detection	—Unverified
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization	Nov 15, 2019	Actin DetectionAction Detection	CodeCode Available
Intelligent Reflecting Surface for Massive Device Connectivity: Joint Activity Detection and Channel Estimation	Nov 12, 2019	Action DetectionActivity Detection	—Unverified
A Proposed Artificial intelligence Model for Real-Time Human Action Localization and Tracking	Nov 9, 2019	Action DetectionAction Localization	—Unverified
The Speed Submission to DIHARD II: Contributions & Lessons Learned	Nov 6, 2019	Action DetectionActivity Detection	—Unverified
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding	Nov 1, 2019	Action DetectionAction Recognition	CodeCode Available
A Bin Encoding Training of a Spiking Neural Network-based Voice Activity Detection	Oct 28, 2019	Action DetectionActivity Detection	—Unverified
Spiking neural networks trained with backpropagation for low power neuromorphic implementation of voice activity detection	Oct 22, 2019	Action DetectionActivity Detection	—Unverified
Multimodal Learning For Classroom Activity Detection	Oct 22, 2019	Action DetectionActivity Detection	—Unverified
AFO-TAD: Anchor-free One-Stage Detector for Temporal Action Detection	Oct 18, 2019	Action Detectionobject-detection	—Unverified
Learning Temporal Action Proposals With Fewer Labels	Oct 3, 2019	Action DetectionSemi-Supervised Action Detection	—Unverified
Temporal Structure Mining for Weakly Supervised Action Detection	Oct 1, 2019	Action DetectionWeakly Supervised Action Localization	—Unverified
Hierarchical Self-Attention Network for Action Localization in Videos	Oct 1, 2019	Action DetectionAction Localization	—Unverified
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification	Sep 26, 2019	Action DetectionActivity Detection	—Unverified
Computer-Aided Automated Detection of Gene-Controlled Social Actions of Drosophila	Sep 11, 2019	Action DetectionClassification	—Unverified
Multi-Stream Single Shot Spatial-Temporal Action Detection	Aug 22, 2019	Action DetectionOptical Flow Estimation	—Unverified
Multi-timescale Trajectory Prediction for Abnormal Human Activity Detection	Aug 12, 2019	Action DetectionActivity Detection	—Unverified
Personal VAD: Speaker-Conditioned Voice Activity Detection	Aug 12, 2019	Action DetectionActivity Detection	CodeCode Available
Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization	Aug 7, 2019	Action DetectionAction Localization	—Unverified
Multi-task Self-Supervised Learning for Human Activity Detection	Jul 27, 2019	Action DetectionActivity Detection	—Unverified
A Novel Approach for Robust Multi Human Action Recognition and Summarization based on 3D Convolutional Neural Networks	Jul 25, 2019	Action DetectionAction Recognition	—Unverified
Attention Filtering for Multi-person Spatiotemporal Action Detection on Deep Two-Stream CNN Architectures	Jul 21, 2019	Action DetectionGeneral Classification	—Unverified
An end-to-end (deep) neural network applied to raw EEG, fNIRs and body motion data for data fusion and BCI classification task without any pre-/post-processing	Jul 17, 2019	Action DetectionActivity Recognition	—Unverified
Deformable Tube Network for Action Detection in Videos	Jul 3, 2019	Action DetectionAction Recognition	—Unverified
An Acoustic Emission Activity Detection Method based on Short-Term Waveform Features: Application to Metallic Components under Uniaxial Tensile Test	Jun 26, 2019	Action DetectionActivity Detection	—Unverified
vireoJD-MM at Activity Detection in Extended Videos	Jun 20, 2019	Action DetectionAction Localization	—Unverified
The Second DIHARD Diarization Challenge: Dataset, task, and baselines	Jun 18, 2019	Action DetectionActivity Detection	CodeCode Available
Accelerating temporal action proposal generation via high performance computing	Jun 15, 2019	Action DetectionAction Recognition	—Unverified
Learning Spatio-Temporal Representation with Local and Global Diffusion	Jun 13, 2019	Action ClassificationAction Detection	CodeCode Available

Show:10 25 50

← PrevPage 13 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified