Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 817 papers

Title	Date	Tasks	Status	Hype
Hardware Accelerator and Neural Network Co-Optimization for Ultra-Low-Power Audio Processing Devices	Sep 8, 2022	Action DetectionActivity Detection	—Unverified	0
Spatio-Temporal Action Detection Under Large Motion	Sep 6, 2022	Action Detection	CodeCode Available	0
A Circular Window-based Cascade Transformer for Online Action Detection	Aug 30, 2022	Action DetectionAction Segmentation	—Unverified	0
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization	Aug 27, 2022	Action DetectionActivity Detection	—Unverified	0
Actor-identified Spatiotemporal Action Detection --- Detecting Who Is Doing What in Videos	Aug 27, 2022	Action ClassificationAction Detection	CodeCode Available	0
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream	Aug 25, 2022	Action DetectionAction Localization	—Unverified	0
Review on Action Recognition for Accident Detection in Smart City Transportation Systems	Aug 20, 2022	Action DetectionAction Recognition	—Unverified	0
Weakly Supervised Online Action Detection for Infant General Movements	Aug 7, 2022	Action DetectionClassification	CodeCode Available	0
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos	Jul 26, 2022	Action DetectionAction Localization	—Unverified	0
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation	Jul 26, 2022	Action DetectionDescriptive	—Unverified	0
Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions	Jul 24, 2022	Action DetectionAction Understanding	CodeCode Available	1
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection	Jul 21, 2022	Action DetectionVideo Understanding	—Unverified	0
Spotting Temporally Precise, Fine-Grained Events in Video	Jul 20, 2022	Action DetectionAction Spotting	CodeCode Available	1
Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning	Jul 20, 2022	Action DetectionAction Recognition	CodeCode Available	1
Zero-Shot Temporal Action Detection via Vision-Language Prompting	Jul 17, 2022	Action DetectionClassification	CodeCode Available	1
Semi-Supervised Temporal Action Detection with Proposal-Free Masking	Jul 14, 2022	Action DetectionGeneral Classification	CodeCode Available	1
ReAct: Temporal Action Detection with Relational Queries	Jul 14, 2022	Action ClassificationAction Detection	CodeCode Available	1
Proposal-Free Temporal Action Detection via Global Segmentation Mask Learning	Jul 14, 2022	Action DetectionRepresentation Learning	CodeCode Available	1
Online Target Speaker Voice Activity Detection for Speaker Diarization	Jul 13, 2022	Action DetectionActivity Detection	—Unverified	0
MM-ALT: A Multimodal Automatic Lyric Transcription System	Jul 13, 2022	Action DetectionActivity Detection	CodeCode Available	1
A semi-supervised methodology for fishing activity detection using the geometry behind the trajectory of multiple vessels	Jul 12, 2022	Action DetectionActivity Detection	CodeCode Available	1
Fine-grained Activities of People Worldwide	Jul 11, 2022	Action DetectionActivity Detection	—Unverified	0
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription	Jul 8, 2022	Action DetectionActivity Detection	—Unverified	0
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay	Jul 4, 2022	Action DetectionActivity Detection	CodeCode Available	0
An AIoT-enabled Autonomous Dementia Monitoring System	Jul 2, 2022	Action DetectionActivity Detection	—Unverified	0
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering	Jun 27, 2022	Action DetectionActivity Detection	CodeCode Available	1
Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation	Jun 21, 2022	Action DetectionTemporal Action Proposal Generation	CodeCode Available	0
One-stage Action Detection Transformer	Jun 21, 2022	Action Detection	—Unverified	0
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection	Jun 20, 2022	Action DetectionActivity Detection	—Unverified	0
Context-aware Proposal Network for Temporal Action Detection	Jun 18, 2022	Action ClassificationAction Detection	—Unverified	0
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios	Jun 17, 2022	Action DetectionActivity Detection	—Unverified	0
RIS Assisted Device Activity Detection with Statistical Channel State Information	Jun 14, 2022	Action DetectionActivity Detection	—Unverified	0
GateHUB: Gated History Unit with Background Suppression for Online Action Detection	Jun 9, 2022	Action DetectionOnline Action Detection	—Unverified	0
A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector	Jun 7, 2022	Action ClassificationAction Detection	CodeCode Available	1
TadML: A fast temporal action detection with Mechanics-MLP	Jun 7, 2022	Action DetectionOptical Flow Estimation	CodeCode Available	0
Stargazer: A transformer-based driver action detection system for intelligent transportation	Jun 1, 2022	Action DetectionAction Recognition	CodeCode Available	1
Data-aided Active User Detection with a User Activity Extraction Network for Grant-free SCMA Systems	May 22, 2022	Action DetectionActivity Detection	—Unverified	0
Structured Attention Composition for Temporal Action Localization	May 20, 2022	Action DetectionAction Localization	CodeCode Available	2
A Boosting Algorithm for Positive-Unlabeled Learning	May 19, 2022	Action DetectionActivity Detection	—Unverified	0
Double-Sided Information Aided Temporal-Correlated Massive Access	May 16, 2022	Action DetectionActivity Detection	—Unverified	0
ETAD: Training Action Detection End to End on a Laptop	May 14, 2022	Action DetectionGPU	CodeCode Available	1
Weakly-Supervised Action Detection Guided by Audio Narration	May 12, 2022	Action Detection	—Unverified	0
An Empirical Study on Activity Recognition in Long Surgical Videos	May 5, 2022	Action DetectionActivity Detection	—Unverified	0
BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection	May 5, 2022	Action Detectionobject-detection	CodeCode Available	1
Ultra-sensitive Flexible Sponge-Sensor Array for Muscle Activities Detection and Human Limb Motion Recognition	Apr 30, 2022	Action DetectionActivity Detection	—Unverified	0
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems	Apr 30, 2022	Action Detection	—Unverified	0
Estimation of Reliable Proposal Quality for Temporal Action Detection	Apr 25, 2022	Action Detection	CodeCode Available	0
ADA-VAD: Unpaired Adversarial Domain Adaptation for Noise-Robust Voice Activity Detection	Apr 22, 2022	Action DetectionActivity Detection	—Unverified	0
A Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions	Apr 21, 2022	Action DetectionVideo Understanding	CodeCode Available	1
Video Action Detection: Analysing Limitations and Challenges	Apr 17, 2022	Action DetectionVideo Action Detection	—Unverified	0

Show:10 25 50

← PrevPage 7 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified