Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–600 of 817 papers

Title	Date	Tasks	Status	Hype
The AFRL IWSLT 2020 Systems: Work-From-Home Edition	Jul 1, 2020	Action DetectionActivity Detection	—Unverified	0
Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations	Jul 1, 2020	Action DetectionActivity Detection	—Unverified	0
Video Representation Learning with Visual Tempo Consistency	Jun 28, 2020	Action AnticipationAction Detection	CodeCode Available	1
Rescaling Egocentric Vision	Jun 23, 2020	Action AnticipationAction Detection	CodeCode Available	1
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge	Jun 14, 2020	Action DetectionActivity Detection	—Unverified	0
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization	Jun 14, 2020	Action DetectionAction Localization	CodeCode Available	1
CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)	Jun 13, 2020	Action DetectionAction Localization	CodeCode Available	1
ESAD: Endoscopic Surgeon Action Detection Dataset	Jun 12, 2020	Action Detection	—Unverified	0
Distributed Optimization for Massive Connectivity	Jun 10, 2020	Action DetectionActivity Detection	—Unverified	0
audino: A Modern Annotation Tool for Audio and Speech	Jun 9, 2020	Action DetectionActivity Detection	CodeCode Available	2
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos	Jun 5, 2020	Action DetectionAction Recognition	—Unverified	0
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features	May 25, 2020	Action DetectionActivity Detection	—Unverified	0
Real-Time Radar-Based Gesture Detection and Recognition Built in an Edge-Computing Platform	May 20, 2020	Action DetectionActivity Detection	—Unverified	0
Siamese Neural Networks for Class Activity Detection	May 15, 2020	Action DetectionActivity Detection	—Unverified	0
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario	May 14, 2020	Action DetectionActivity Detection	—Unverified	0
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention	May 8, 2020	Action DetectionActivity Detection	—Unverified	0
Spatio-Temporal Event Segmentation and Localization for Wildlife Extended Videos	May 5, 2020	Action DetectionActivity Detection	—Unverified	0
A Benchmark for Structured Procedural Knowledge Extraction from Cooking Videos	May 2, 2020	Action DetectionForm	CodeCode Available	1
The SAFE-T Corpus: A New Resource for Simulated Public Safety Communications	May 1, 2020	Action DetectionActivity Detection	—Unverified	0
Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech	May 1, 2020	Acoustic ModellingAction Detection	—Unverified	0
Activity Detection from Wearable Electromyogram Sensors using Hidden Markov Model	Apr 27, 2020	Action DetectionActivity Detection	—Unverified	0
Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos	Apr 23, 2020	Action DetectionActivity Detection	—Unverified	0
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition	Apr 21, 2020	3D Face ReconstructionAction Detection	—Unverified	0
Group Activity Detection from Trajectory and Video Data in Soccer	Apr 21, 2020	Action DetectionActivity Detection	—Unverified	0
Asynchronous Interaction Aggregation for Action Detection	Apr 16, 2020	Action DetectionVideo Action Detection	CodeCode Available	1
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos	Apr 15, 2020	Action DetectionAction Spotting	—Unverified	0
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech	Apr 8, 2020	Acoustic ModellingAction Detection	—Unverified	0
Progressive Boundary Refinement Network for Temporal Action Detection	Apr 3, 2020	Action Detection	—Unverified	0
Two-Stream AMTnet for Action Detection	Apr 3, 2020	Action DetectionAutonomous Driving	CodeCode Available	0
PaStaNet: Toward Human Activity Knowledge Engine	Apr 2, 2020	Action DetectionHuman-Object Interaction Detection	CodeCode Available	1
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection	Apr 2, 2020	Action DetectionActivity Detection	—Unverified	0
Spatio-Temporal Action Detection with Multi-Object Interaction	Apr 1, 2020	Action DetectionHuman Detection	—Unverified	0
Revisiting Few-shot Activity Detection with Class Similarity Control	Mar 31, 2020	Action DetectionActivity Detection	—Unverified	0
Long Short-Term Relation Networks for Video Action Detection	Mar 31, 2020	Action DetectionObject	—Unverified	0
Dual Attention in Time and Frequency Domain for Voice Activity Detection	Mar 27, 2020	Action DetectionActivity Detection	CodeCode Available	0
Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol	Mar 26, 2020	Action DetectionOnline Action Detection	CodeCode Available	0
The Instantaneous Accuracy: a Novel Metric for the Problem of Online Human Behaviour Recognition in Untrimmed Videos	Mar 22, 2020	Action DetectionOnline Action Detection	CodeCode Available	0
Comprehensive Instructional Video Analysis: The COIN Dataset and Performance Evaluation	Mar 20, 2020	Action Detection	—Unverified	0
A Novel Online Action Detection Framework from Untrimmed Video Streams	Mar 17, 2020	Action DetectionAction Localization	—Unverified	0
ZSTAD: Zero-Shot Temporal Activity Detection	Mar 12, 2020	Action DetectionActivity Detection	—Unverified	0
Cross modal video representations for weakly supervised active speaker localization	Mar 9, 2020	Action DetectionActive Speaker Localization	—Unverified	0
Argus: Efficient Activity Detection System for Extended Video Analysis	Mar 2, 2020	Action DetectionActivity Detection	—Unverified	0
DIHARD II is Still Hard: Experimental Results and Discussions from the DKU-LENOVO Team	Feb 23, 2020	Action DetectionActivity Detection	—Unverified	0
Back to the Future: Joint Aware Temporal Deep Learning 3D Human Pose Estimation	Feb 22, 2020	3D Human Pose EstimationAction Detection	CodeCode Available	0
Harvesting Ambient RF for Presence Detection Through Deep Learning	Feb 13, 2020	Action DetectionActivity Detection	CodeCode Available	1
Human Activity Recognition: A Spatio-temporal Image Encoding of 3D Skeleton Data for Online Action Detection	Feb 8, 2020	Action DetectionActivity Recognition	CodeCode Available	0
3D ResNet with Ranking Loss Function for Abnormal Activity Detection in Videos	Feb 4, 2020	Action DetectionAction Recognition	—Unverified	0
End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection	Feb 3, 2020	Action DetectionActivity Detection	—Unverified	0
Faster Activity and Data Detection in Massive Random Access: A Multi-armed Bandit Approach	Jan 28, 2020	Action DetectionActivity Detection	—Unverified	0
A Comprehensive Study on Temporal Modeling for Online Action Detection	Jan 21, 2020	Action DetectionOnline Action Detection	CodeCode Available	0

Show:10 25 50

← PrevPage 12 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified