Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 817 papers

Title	Date	Tasks	Status
Action Detection via an Image Diffusion Process	Apr 1, 2024	Action DetectionImage Generation	—Unverified
Cross-domain Voice Activity Detection with Self-Supervised Representations	Sep 22, 2022	Action DetectionActivity Detection	—Unverified
Cross modal video representations for weakly supervised active speaker localization	Mar 9, 2020	Action DetectionActive Speaker Localization	—Unverified
Cross-modal Supervision for Learning Active Speaker Detection in Video	Mar 29, 2016	Action DetectionActive Speaker Detection	—Unverified
CTRN: Class-Temporal Relational Network for Action Detection	Oct 26, 2021	Action Detection	—Unverified
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model	Jan 29, 2024	Action DetectionAction Localization	—Unverified
Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection	Jul 16, 2018	Action DetectionObject	—Unverified
Continuous Human Action Detection Based on Wearable Inertial Data	Dec 11, 2021	Action DetectionGesture Recognition	—Unverified
Data-aided Active User Detection with a User Activity Extraction Network for Grant-free SCMA Systems	May 22, 2022	Action DetectionActivity Detection	—Unverified
Dataset for Real-World Human Action Detection Using FMCW mmWave Radar	Dec 23, 2024	Action DetectionPrivacy Preserving	—Unverified
A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos	Nov 20, 2018	Action ClassificationAction Detection	—Unverified
ADA-VAD: Unpaired Adversarial Domain Adaptation for Noise-Robust Voice Activity Detection	Apr 22, 2022	Action DetectionActivity Detection	—Unverified
Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection	Mar 30, 2023	Action DetectionAction Localization	—Unverified
Deconstruct Complexity (DeComplex): A Novel Perspective on Tackling Dense Action Detection	Jan 30, 2025	Action DetectionContrastive Learning	—Unverified
whu-nercms at trecvid2021:instance search task	Oct 30, 2021	Action DetectionFace Detection	—Unverified
Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication	Mar 12, 2024	Action DetectionActivity Detection	—Unverified
Deep Learning-based Action Detection in Untrimmed Videos: A Survey	Sep 30, 2021	Action DetectionAction Recognition	—Unverified
Deep learning-based approaches for human motion decoding in smart walkers for rehabilitation	Jan 13, 2023	Action DetectionAction Recognition	—Unverified
Continual Low-Rank Scaled Dot-product Attention	Dec 4, 2024	Action DetectionAudio Classification	—Unverified
Deep Learning for Asynchronous Massive Access with Data Frame Length Diversity	May 12, 2023	Action DetectionActivity Detection	—Unverified
Deep Learning for Detecting Multiple Space-Time Action Tubes in Videos	Aug 4, 2016	Action DetectionMotion Detection	—Unverified
Deep Learning for Encrypted Traffic Classification and Unknown Data Detection	Mar 25, 2022	Action DetectionActivity Detection	—Unverified
Context Understanding in Computer Vision: A Survey	Feb 10, 2023	Action Detectionimage-classification	—Unverified
Detection of Object Throwing Behavior in Surveillance Videos	Mar 11, 2024	Action DetectionAnomaly Detection	—Unverified
Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO	Feb 7, 2024	Action DetectionActivity Detection	—Unverified
Device Detection and Channel Estimation in MTC with Correlated Activity Pattern	Oct 23, 2023	Action DetectionActivity Detection	—Unverified
A processing framework to access large quantities of whispered speech found in ASMR	Mar 13, 2023	Action DetectionActivity Detection	—Unverified
Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection	Jan 28, 2018	Action DetectionActivity Detection	—Unverified
Application of Machine Learning Techniques in Human Activity Recognition	Oct 19, 2015	Action DetectionActivity Detection	—Unverified
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team	Feb 23, 2020	Action DetectionActivity Detection	—Unverified
Access Delay Constrained Activity Detection in Massive Random Access	Nov 4, 2021	Action DetectionActivity Detection	—Unverified
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios	Sep 26, 2023	Action DetectionHuman-Object Interaction Detection	—Unverified
Distributed Activity Detection for Cell-Free Hybrid Near-Far Field Communications	Jun 17, 2025	Action DetectionActivity Detection	—Unverified
Distributed Optimization for Massive Connectivity	Jun 10, 2020	Action DetectionActivity Detection	—Unverified
DOAD: Decoupled One Stage Action Detection Network	Apr 1, 2023	Action DetectionAction Recognition	—Unverified
Double-Sided Information Aided Temporal-Correlated Massive Access	May 16, 2022	Action DetectionActivity Detection	—Unverified
DT4ECG: A Dual-Task Learning Framework for ECG-Based Human Identity Recognition and Human Activity Detection	Feb 16, 2025	Action DetectionActivity Detection	—Unverified
Attention Filtering for Multi-person Spatiotemporal Action Detection on Deep Two-Stream CNN Architectures	Jul 21, 2019	Action DetectionGeneral Classification	—Unverified
Dual DETRs for Multi-Label Temporal Action Detection	Mar 31, 2024	Action Detectionobject-detection	—Unverified
ESAD: Endoscopic Surgeon Action Detection Dataset	Jun 12, 2020	Action Detection	—Unverified
Fast Low-parameter Video Activity Localization in Collaborative Learning Environments	Mar 2, 2024	Action DetectionActivity Detection	—Unverified
Context-LSTM: a robust classifier for video detection on UCF101	Mar 13, 2022	Action DetectionAction Recognition	—Unverified
Application-Driven AI Paradigm for Hand-Held Action Detection	Oct 13, 2022	Action DetectionObject	—Unverified
Early Detection of In-Memory Malicious Activity based on Run-time Environmental Features	Mar 30, 2021	Action DetectionActivity Detection	—Unverified
Effective Abnormal Activity Detection on Multivariate Time Series Healthcare Data	Sep 11, 2023	Action DetectionActivity Detection	—Unverified
Efficient Action Detection in Untrimmed Videos via Multi-Task Learning	Dec 22, 2016	Action DetectionAction Localization	—Unverified
ContextDet: Temporal Action Detection with Adaptive Context Aggregation	Oct 20, 2024	Action DetectionVideo Understanding	—Unverified
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation	Apr 23, 2024	Action DetectionActivity Detection	—Unverified
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization	Jan 6, 2022	Action DetectionActive Speaker Detection	—Unverified
Context-aware Proposal Network for Temporal Action Detection	Jun 18, 2022	Action ClassificationAction Detection	—Unverified

Show:10 25 50

← PrevPage 5 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified