Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–425 of 817 papers

Title	Date	Tasks	Status
Sequence Block based Compressed Sensing Multiuser Detection for 5G	Sep 28, 2018	Action DetectionActivity Detection	—Unverified
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation	Nov 21, 2024	Action DetectionActivity Detection	—Unverified
Siamese Neural Networks for Class Activity Detection	May 15, 2020	Action DetectionActivity Detection	—Unverified
Signed Latent Factors for Spamming Activity Detection	Sep 28, 2022	Action DetectionActivity Detection	—Unverified
Similarity R-C3D for Few-shot Temporal Activity Detection	Dec 25, 2018	Action DetectionActivity Detection	—Unverified
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios	Jun 17, 2022	Action DetectionActivity Detection	—Unverified
Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments	Jan 7, 2024	Action DetectionActivity Detection	—Unverified
Skeleton Boxes: Solving skeleton based action detection with a single deep convolutional neural network	Apr 19, 2017	Action DetectionAction Recognition	—Unverified
SkeleTR: Towards Skeleton-based Action Recognition in the Wild	Jan 1, 2023	Action ClassificationAction Detection	—Unverified
SkeleTR: Towrads Skeleton-based Action Recognition in the Wild	Sep 20, 2023	Action ClassificationAction Detection	—Unverified
Smart Black Box 2.0: Efficient High-bandwidth Driving Data Collection based on Video Anomalies	Jan 3, 2021	Action DetectionAnomaly Detection	—Unverified
Sparse Activity Discovery in Energy Constrained Multi-Cluster IoT Networks Using Group Testing	Mar 30, 2021	Action DetectionActivity Detection	—Unverified
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming	Aug 20, 2021	Action DetectionActivity Detection	—Unverified
Spatial Correlation Aware Compressed Sensing for User Activity Detection and Channel Estimation in Massive MTC	Apr 17, 2021	Action DetectionActivity Detection	—Unverified
Spatial Morphing Kernel Regression For Feature Interpolation	Feb 21, 2018	Action DetectionActivity Detection	—Unverified
Spatial-Temporal Alignment Network for Action Recognition and Detection	Dec 4, 2020	Action DetectionAction Recognition	—Unverified
Spatio-Temporal Action Detection with Cascade Proposal and Location Anticipation	Jul 31, 2017	Action DetectionRegion Proposal	—Unverified
Spatio-Temporal Action Detection with Multi-Object Interaction	Apr 1, 2020	Action DetectionHuman Detection	—Unverified
Spatio-Temporal Action Localization in a Weakly Supervised Setting	May 6, 2019	Action DetectionAction Localization	—Unverified
Spatio-temporal Action Recognition: A Survey	Jan 27, 2019	Action DetectionAction Localization	—Unverified
Spatio-Temporal Context for Action Detection	Jun 29, 2021	Action DetectionVideo Understanding	—Unverified
Spatio-Temporal Context Prompting for Zero-Shot Action Detection	Aug 28, 2024	Action DetectionZero-Shot Action Detection	—Unverified
Spatiotemporal Deformable Scene Graphs for Complex Activity Detection	Apr 16, 2021	Action DetectionActivity Detection	—Unverified
Spatiotemporal Deformable Part Models for Action Detection	Jun 1, 2013	Action Detectionobject-detection	—Unverified
Spatiotemporal Event Graphs for Dynamic Scene Understanding	Dec 11, 2023	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 17 of 33Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified