Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 817 papers

Title	Date	Tasks	Status
Online Target Speaker Voice Activity Detection for Speaker Diarization	Jul 13, 2022	Action DetectionActivity Detection	—Unverified
Fine-grained Activities of People Worldwide	Jul 11, 2022	Action DetectionActivity Detection	—Unverified
Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription	Jul 8, 2022	Action DetectionActivity Detection	—Unverified
Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay	Jul 4, 2022	Action DetectionActivity Detection	CodeCode Available
An AIoT-enabled Autonomous Dementia Monitoring System	Jul 2, 2022	Action DetectionActivity Detection	—Unverified
Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation	Jun 21, 2022	Action DetectionTemporal Action Proposal Generation	CodeCode Available
One-stage Action Detection Transformer	Jun 21, 2022	Action Detection	—Unverified
Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection	Jun 20, 2022	Action DetectionActivity Detection	—Unverified
Context-aware Proposal Network for Temporal Action Detection	Jun 18, 2022	Action ClassificationAction Detection	—Unverified
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios	Jun 17, 2022	Action DetectionActivity Detection	—Unverified
RIS Assisted Device Activity Detection with Statistical Channel State Information	Jun 14, 2022	Action DetectionActivity Detection	—Unverified
GateHUB: Gated History Unit with Background Suppression for Online Action Detection	Jun 9, 2022	Action DetectionOnline Action Detection	—Unverified
TadML: A fast temporal action detection with Mechanics-MLP	Jun 7, 2022	Action DetectionOptical Flow Estimation	CodeCode Available
Data-aided Active User Detection with a User Activity Extraction Network for Grant-free SCMA Systems	May 22, 2022	Action DetectionActivity Detection	—Unverified
A Boosting Algorithm for Positive-Unlabeled Learning	May 19, 2022	Action DetectionActivity Detection	—Unverified
Double-Sided Information Aided Temporal-Correlated Massive Access	May 16, 2022	Action DetectionActivity Detection	—Unverified
Weakly-Supervised Action Detection Guided by Audio Narration	May 12, 2022	Action Detection	—Unverified
An Empirical Study on Activity Recognition in Long Surgical Videos	May 5, 2022	Action DetectionActivity Detection	—Unverified
RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems	Apr 30, 2022	Action Detection	—Unverified
Ultra-sensitive Flexible Sponge-Sensor Array for Muscle Activities Detection and Human Limb Motion Recognition	Apr 30, 2022	Action DetectionActivity Detection	—Unverified
Estimation of Reliable Proposal Quality for Temporal Action Detection	Apr 25, 2022	Action Detection	CodeCode Available
ADA-VAD: Unpaired Adversarial Domain Adaptation for Noise-Robust Voice Activity Detection	Apr 22, 2022	Action DetectionActivity Detection	—Unverified
Video Action Detection: Analysing Limitations and Challenges	Apr 17, 2022	Action DetectionVideo Action Detection	—Unverified
Anomalous Sound Detection Based on Machine Activity Detection	Apr 15, 2022	Action DetectionActivity Detection	—Unverified
Automated speech tools for helping communities process restricted-access corpora for language revival efforts	Apr 15, 2022	Action DetectionActivity Detection	—Unverified
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition	Apr 8, 2022	Action DetectionActivity Detection	—Unverified
Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network	Apr 6, 2022	Action DetectionAction Spotting	—Unverified
Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random Access	Apr 4, 2022	Action DetectionActivity Detection	CodeCode Available
Deep Learning for Encrypted Traffic Classification and Unknown Data Detection	Mar 25, 2022	Action DetectionActivity Detection	—Unverified
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios	Mar 18, 2022	Action DetectionActivity Detection	—Unverified
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation	Mar 16, 2022	Action DetectionTemporal Action Proposal Generation	CodeCode Available
RCL: Recurrent Continuous Localization for Temporal Action Detection	Mar 14, 2022	Action Detection	—Unverified
Context-LSTM: a robust classifier for video detection on UCF101	Mar 13, 2022	Action DetectionAction Recognition	—Unverified
Human Attention Detection Using AM-FM Representations	Mar 9, 2022	Action DetectionActivity Detection	—Unverified
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos	Mar 8, 2022	Action DetectionActivity Detection	—Unverified
SegTAD: Precise Temporal Action Detection via Semantic Segmentation	Mar 3, 2022	Action Detectionobject-detection	—Unverified
Random Access with Massive MIMO-OTFS in LEO Satellite Communications	Feb 26, 2022	Action DetectionActivity Detection	—Unverified
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition	Feb 22, 2022	Action DetectionActivity Detection	—Unverified
Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing	Feb 11, 2022	Action DetectionActivity Detection	—Unverified
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge	Feb 10, 2022	Action DetectionActivity Detection	—Unverified
Untrimmed Action Anticipation	Feb 8, 2022	Action AnticipationAction Detection	—Unverified
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge	Feb 6, 2022	Action DetectionActivity Detection	—Unverified
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge	Feb 4, 2022	Action DetectionActivity Detection	—Unverified
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals	Jan 14, 2022	Action DetectionActivity Detection	—Unverified
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization	Jan 6, 2022	Action DetectionActive Speaker Detection	—Unverified
Merry Go Round: Rotate a Frame and Fool a DNN	Jan 1, 2022	Action DetectionActivity Detection	—Unverified
Binary Image Skeletonization Using 2-Stage U-Net	Dec 22, 2021	Action DetectionActivity Detection	—Unverified
Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark	Dec 16, 2021	Action ClassificationAction Detection	CodeCode Available
Low Resource Species Agnostic Bird Activity Detection	Dec 16, 2021	Action DetectionActivity Detection	—Unverified
Two Stream Network for Stroke Detection in Table Tennis	Dec 16, 2021	Action DetectionVocal Bursts Valence Prediction	—Unverified

Show:10 25 50

← PrevPage 9 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified