Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 751–800 of 817 papers

Title	Date	Tasks	Status
Untrimmed Video Classification for Activity Detection: submission to ActivityNet Challenge	Jul 7, 2016	Action DetectionActivity Detection	CodeCode Available
When do they StOP?: A First Step Towards Automatically Identifying Team Communication in the Operating Room	Feb 12, 2025	Action DetectionActivity Detection	CodeCode Available
Dual Attention in Time and Frequency Domain for Voice Activity Detection	Mar 27, 2020	Action DetectionActivity Detection	CodeCode Available
Pre-Equalization Aided Grant-Free Massive Access in Massive MIMO System	Feb 10, 2025	Action DetectionActivity Detection	CodeCode Available
Incremental Tube Construction for Human Action Detection	Apr 5, 2017	Action Detection	CodeCode Available
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information	Nov 28, 2021	Action DetectionActivity Detection	CodeCode Available
Progression-Guided Temporal Action Detection in Videos	Aug 18, 2023	Action ClassificationAction Detection	CodeCode Available
ACDnet: An action detection network for real-time edge computing based on flow-guided feature approximation and memory aggregation	Feb 26, 2021	Action DetectionEdge-computing	CodeCode Available
A flexible model for training action localization with varying levels of supervision	Jun 29, 2018	Action DetectionAction Localization	CodeCode Available
A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning	Jun 22, 2017	Action DetectionPosition	CodeCode Available
A Pursuit of Temporal Accuracy in General Activity Detection	Mar 8, 2017	Action DetectionActivity Detection	CodeCode Available
Identifying Visible Actions in Lifestyle Vlogs	Jun 10, 2019	Action Detection	CodeCode Available
Protest Activity Detection and Perceived Violence Estimation from Social Media Images	Sep 18, 2017	Action DetectionActivity Detection	CodeCode Available
The Instantaneous Accuracy: a Novel Metric for the Problem of Online Human Behaviour Recognition in Untrimmed Videos	Mar 22, 2020	Action DetectionOnline Action Detection	CodeCode Available
Human Activity Recognition: A Spatio-temporal Image Encoding of 3D Skeleton Data for Online Action Detection	Feb 8, 2020	Action DetectionActivity Recognition	CodeCode Available
Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation	Jun 21, 2022	Action DetectionTemporal Action Proposal Generation	CodeCode Available
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action Recognition	Dec 11, 2019	Action ClassificationAction Detection	CodeCode Available
Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021	Dec 16, 2021	Action DetectionFine-Grained Action Detection	CodeCode Available
RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical Flow	Sep 28, 2022	Action ClassificationAction Detection	CodeCode Available
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022	Jan 31, 2023	Action DetectionBenchmarking	CodeCode Available
Discovering Multi-Label Actor-Action Association in a Weakly Supervised Setting	Jan 21, 2021	Action DetectionMulti-Label Learning	CodeCode Available
Decoupling Localization and Classification in Single Shot Temporal Action Detection	Apr 16, 2019	Action DetectionClassification	CodeCode Available
R-C3D: Region Convolutional 3D Network for Temporal Activity Detection	Mar 22, 2017	Action DetectionAction Recognition In Videos	CodeCode Available
Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos	Mar 30, 2017	Action Detectionimage-classification	CodeCode Available
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment	Aug 7, 2020	Action DetectionDecoder	CodeCode Available
Handwashing Action Detection System for an Autonomous Social Robot	Oct 27, 2022	Action DetectionAction Recognition	CodeCode Available
Real-Time Action Detection in Video Surveillance using Sub-Action Descriptor with Multi-CNN	Oct 10, 2017	Action DetectionAction Recognition	CodeCode Available
SST: Single-Stream Temporal Action Proposals	Jul 1, 2017	Action DetectionTemporal Action Proposal Generation	CodeCode Available
Stable Mean Teacher for Semi-supervised Video Action Detection	Dec 10, 2024	Action DetectionSemantic Segmentation	CodeCode Available
Graph Distillation for Action Detection with Privileged Modalities	Nov 30, 2017	Action ClassificationAction Detection	CodeCode Available
Video action detection by learning graph-based spatio-temporal interactions	Dec 9, 2019	Action DetectionAction Localization	CodeCode Available
Dance with Flow: Two-in-One Stream Action Detection	Apr 1, 2019	Action DetectionOptical Flow Estimation	CodeCode Available
Two-Stream AMTnet for Action Detection	Apr 3, 2020	Action DetectionAutonomous Driving	CodeCode Available
A Convolutional Neural Network Smartphone App for Real-Time Voice Activity Detection	Feb 1, 2018	Action DetectionActivity Detection	CodeCode Available
Refining Action Boundaries for One-stage Detection	Oct 25, 2022	Action Detection	CodeCode Available
STEP: Spatio-Temporal Progressive Learning for Video Action Detection	Apr 19, 2019	Action DetectionAction Recognition	CodeCode Available
CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection	Mar 28, 2023	Action DetectionAction Recognition	CodeCode Available
Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random Access	Apr 4, 2022	Action DetectionActivity Detection	CodeCode Available
Fine-Grained Classroom Activity Detection from Audio with Neural Networks	Jul 29, 2021	Action DetectionActivity Detection	CodeCode Available
RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns	Aug 21, 2020	Action DetectionActivity Detection	CodeCode Available
Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol	Mar 26, 2020	Action DetectionOnline Action Detection	CodeCode Available
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification	Dec 13, 2017	Action ClassificationAction Detection	CodeCode Available
Review of Action Recognition and Detection Methods	Oct 21, 2016	Action DetectionAction Recognition	CodeCode Available
Actor Conditioned Attention Maps for Video Action Detection	Dec 30, 2018	Action DetectionVideo Action Detection	CodeCode Available
Contextual Explainable Video Representation: Human Perception-based Understanding	Dec 12, 2022	Action DetectionAction Recognition	CodeCode Available
Fine-grained Activity Recognition in Baseball Videos	Apr 9, 2018	Action DetectionActivity Detection	CodeCode Available
Fine-Grained Action Detection with RGB and Pose Information using Two Stream Convolutional Networks	Feb 6, 2023	Action ClassificationAction Detection	CodeCode Available
Structure-Aware Convolutional Neural Networks	Dec 1, 2018	Action DetectionAction Recognition	CodeCode Available
SoccerDB: A Large-Scale Database for Comprehensive Video Understanding	Dec 10, 2019	Action ClassificationAction Detection	CodeCode Available
Finding Action Tubes	Nov 21, 2014	Action Detectionobject-detection	CodeCode Available

Show:10 25 50

← PrevPage 16 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified