Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 276–300 of 817 papers

Title	Date	Tasks	Status
Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access	Apr 27, 2021	Action DetectionActivity Detection	—Unverified
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis	Mar 7, 2019	Action Detection	—Unverified
Anomalous Event Recognition in Videos Based on Joint Learningof Motion and Appearance with Multiple Ranking Measures	Feb 2, 2021	Action DetectionActivity Detection	—Unverified
Actor-Centric Relation Network	Jul 28, 2018	Action ClassificationAction Detection	—Unverified
CLIP-VAD: Exploiting Vision-Language Models for Voice Activity Detection	Oct 18, 2024	Action DetectionActivity Detection	—Unverified
Class Semantics-based Attention for Action Detection	Sep 6, 2021	Action DetectionAction Localization	—Unverified
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming	Jun 14, 2024	Action DetectionActivity Detection	—Unverified
A Boosting Algorithm for Positive-Unlabeled Learning	May 19, 2022	Action DetectionActivity Detection	—Unverified
Classification Matters: Improving Video Action Detection with Class-Specific Attention	Jul 29, 2024	Action DetectionClassification	—Unverified
Self-supervised New Activity Detection in Sensor-based Smart Environments	Jan 17, 2024	Action DetectionActivity Detection	—Unverified
A new network-based algorithm for human activity recognition in video	Feb 21, 2015	Action DetectionActivity Detection	—Unverified
An Ensemble SVM-based Approach for Voice Activity Detection	Feb 5, 2019	Action DetectionActivity Detection	—Unverified
ACT-Net: Anchor-context Action Detection in Surgery Videos	Oct 5, 2023	Action DetectionDenoising	—Unverified
GTTS-EHU Systems for QUESST at MediaEval 2014	Oct 16, 2014	Action DetectionActivity Detection	—Unverified
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection	Feb 13, 2024	Action DetectionActivity Detection	—Unverified
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization	Aug 19, 2020	Action DetectionAction Localization	—Unverified
An enhanced system for the detection and active cancellation of snoring signals	Jul 31, 2023	Action DetectionActivity Detection	—Unverified
An end-to-end (deep) neural network applied to raw EEG, fNIRs and body motion data for data fusion and BCI classification task without any pre-/post-processing	Jul 17, 2019	Action DetectionActivity Recognition	—Unverified
Activity Recognition with Moving Cameras and Few Training Examples: Applications for Detection of Autism-Related Headbanging	Jan 10, 2021	Action DetectionActivity Detection	—Unverified
3rd party observer gaze as a continuous measure of dialogue flow	May 1, 2012	Action Detection	—Unverified
CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment	Jun 25, 2025	Action DetectionActivity Detection	—Unverified
Cascaded Boundary Regression for Temporal Action Detection	May 2, 2017	Action Detectionregression	—Unverified
CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors	Mar 4, 2025	Action DetectionActivity Detection	—Unverified
An End-to-end 3D Convolutional Neural Network for Action Detection and Segmentation in Videos	Nov 30, 2017	Action DetectionAction Segmentation	—Unverified
Group Event Detection with a Varying Number of Group Members for Video Surveillance	Feb 28, 2015	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 12 of 33Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified