Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–675 of 817 papers

Title	Date	Tasks	Status
Recursive Binary Neural Network Learning Model with 2-bit/weight Storage Requirement	Jan 1, 2018	Action DetectionActivity Detection	—Unverified
Reformulating Zero-shot Action Recognition for Multi-label Actions	Dec 1, 2021	Action ClassificationAction Detection	—Unverified
Relation Modeling in Spatio-Temporal Action Localization	Jun 15, 2021	Action DetectionAction Localization	—Unverified
Review on Action Recognition for Accident Detection in Smart City Transportation Systems	Aug 20, 2022	Action DetectionAction Recognition	—Unverified
Revisiting Few-shot Activity Detection with Class Similarity Control	Mar 31, 2020	Action DetectionActivity Detection	—Unverified
RIS Assisted Device Activity Detection with Statistical Channel State Information	Jun 14, 2022	Action DetectionActivity Detection	—Unverified
Risk Analysis and Prevention: LELIE, a Tool dedicated to Procedure and Requirement Authoring	May 1, 2012	Action Detection	—Unverified
Zero-Shot Imitating Collaborative Manipulation Plans from YouTube Cooking Videos	Nov 25, 2019	Action Detection	—Unverified
Robust Activity Detection for Massive Random Access	May 21, 2025	Action DetectionActivity Detection	—Unverified
Robust Learning-Based Sparse Recovery for Device Activity Detection in Grant-Free Random Access Cell-Free Massive MIMO: Enhancing Resilience to Impairments	Mar 13, 2025	Action DetectionActivity Detection	—Unverified
Robust Two-Stream Multi-Feature Network for Driver Drowsiness Detection	Oct 13, 2020	Action Detectionimage-classification	—Unverified
SALAD: Self-Assessment Learning for Action Detection	Nov 13, 2020	Action DetectionAction Localization	—Unverified
SCC: Semantic Context Cascade for Efficient Action Detection	Jul 1, 2017	Action Detection	—Unverified
SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras	Aug 19, 2020	Action DetectionActivity Detection	—Unverified
Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection	Nov 19, 2018	Action DetectionMultiple Action Detection	—Unverified
SegTAD: Precise Temporal Action Detection via Semantic Segmentation	Mar 3, 2022	Action Detectionobject-detection	—Unverified
Self-Adaptive Soft Voice Activity Detection using Deep Neural Networks for Robust Speaker Verification	Sep 26, 2019	Action DetectionActivity Detection	—Unverified
Self-Denoising Neural Networks for Few Shot Learning	Oct 26, 2021	Action DetectionDenoising	—Unverified
Self-Feedback DETR for Temporal Action Detection	Aug 21, 2023	Action DetectionDecoder	—Unverified
Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions	Dec 27, 2023	Action DetectionActivity Detection	—Unverified
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction	May 21, 2023	Action DetectionActivity Detection	—Unverified
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech	Apr 8, 2020	Acoustic ModellingAction Detection	—Unverified
Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech	May 1, 2020	Acoustic ModellingAction Detection	—Unverified
Sensing Framework Design and Performance Optimization with Action Detection for ISCC	May 5, 2025	Action Detection	—Unverified
Sequence Block based Compressed Sensing Multiuser Detection for 5G	Sep 28, 2018	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 27 of 33Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified