Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–600 of 817 papers

Title	Date	Tasks	Status
The "Sound of Silence" in EEG -- Cognitive voice activity detection	Oct 12, 2020	Action DetectionActivity Detection	—Unverified
Online Action Detection in Streaming Videos with Time Buffers	Oct 6, 2020	Action DetectionOnline Action Detection	—Unverified
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments	Oct 6, 2020	Action DetectionActivity Detection	—Unverified
Grant-Free Access via Bilinear Inference for Cell-Free MIMO with Low-Coherent Pilots	Sep 27, 2020	Action DetectionActivity Detection	—Unverified
Learning Visual Voice Activity Detection with an Automatically Annotated Dataset	Sep 23, 2020	Action DetectionActivity Detection	—Unverified
TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval	Sep 21, 2020	Action DetectionActivity Detection	—Unverified
On Multitask Loss Function for Audio Event Detection and Localization	Sep 11, 2020	Action DetectionActivity Detection	—Unverified
Massive Machine Type Communication Pilot-Hopping Sequence Detection Architectures Based on Non-Negative Least Squares for Grant-Free Random Access	Sep 4, 2020	Action DetectionActivity Detection	—Unverified
Online Spatiotemporal Action Detection and Prediction via Causal Representations	Aug 31, 2020	Action DetectionAction Recognition	CodeCode Available
Finding Action Tubes with a Sparse-to-Dense Framework	Aug 30, 2020	Action Detection	—Unverified
RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns	Aug 21, 2020	Action DetectionActivity Detection	CodeCode Available
SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras	Aug 19, 2020	Action DetectionActivity Detection	—Unverified
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization	Aug 19, 2020	Action DetectionAction Localization	—Unverified
MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection	Aug 13, 2020	Action DetectionActivity Detection	—Unverified
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment	Aug 7, 2020	Action DetectionDecoder	CodeCode Available
Multi-Level Temporal Pyramid Network for Action Detection	Aug 7, 2020	Action Detection	—Unverified
Jointly Sparse Signal Recovery and Support Recovery via Deep Learning with Applications in MIMO-based Grant-Free Random Access	Aug 5, 2020	Action DetectionActivity Detection	—Unverified
"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)	Aug 4, 2020	Action DetectionActivity Detection	—Unverified
Boundary Content Graph Neural Network for Temporal Action Proposal Generation	Aug 4, 2020	Action DetectionAction Understanding	—Unverified
Towards Efficient Coarse-to-Fine Networks for Action and Gesture Recognition	Aug 1, 2020	3D Action RecognitionAction Classification	—Unverified
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments	Jul 28, 2020	Action DetectionActivity Detection	—Unverified
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos	Jul 21, 2020	Action DetectionAction Recognition	—Unverified
The AFRL IWSLT 2020 Systems: Work-From-Home Edition	Jul 1, 2020	Action DetectionActivity Detection	—Unverified
Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations	Jul 1, 2020	Action DetectionActivity Detection	—Unverified
The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge	Jun 14, 2020	Action DetectionActivity Detection	—Unverified
ESAD: Endoscopic Surgeon Action Detection Dataset	Jun 12, 2020	Action Detection	—Unverified
Distributed Optimization for Massive Connectivity	Jun 10, 2020	Action DetectionActivity Detection	—Unverified
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos	Jun 5, 2020	Action DetectionAction Recognition	—Unverified
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features	May 25, 2020	Action DetectionActivity Detection	—Unverified
Real-Time Radar-Based Gesture Detection and Recognition Built in an Edge-Computing Platform	May 20, 2020	Action DetectionActivity Detection	—Unverified
Siamese Neural Networks for Class Activity Detection	May 15, 2020	Action DetectionActivity Detection	—Unverified
Target-Speaker Voice Activity Detection: a Novel Approach for Multi-Speaker Diarization in a Dinner Party Scenario	May 14, 2020	Action DetectionActivity Detection	—Unverified
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention	May 8, 2020	Action DetectionActivity Detection	—Unverified
Spatio-Temporal Event Segmentation and Localization for Wildlife Extended Videos	May 5, 2020	Action DetectionActivity Detection	—Unverified
The SAFE-T Corpus: A New Resource for Simulated Public Safety Communications	May 1, 2020	Action DetectionActivity Detection	—Unverified
Semi-supervised Acoustic Modelling for Five-lingual Code-switched ASR using Automatically-segmented Soap Opera Speech	May 1, 2020	Acoustic ModellingAction Detection	—Unverified
Activity Detection from Wearable Electromyogram Sensors using Hidden Markov Model	Apr 27, 2020	Action DetectionActivity Detection	—Unverified
Gabriella: An Online System for Real-Time Activity Detection in Untrimmed Security Videos	Apr 23, 2020	Action DetectionActivity Detection	—Unverified
TAEN: Temporal Aware Embedding Network for Few-Shot Action Recognition	Apr 21, 2020	3D Face ReconstructionAction Detection	—Unverified
Group Activity Detection from Trajectory and Video Data in Soccer	Apr 21, 2020	Action DetectionActivity Detection	—Unverified
ActionSpotter: Deep Reinforcement Learning Framework for Temporal Action Spotting in Videos	Apr 15, 2020	Action DetectionAction Spotting	—Unverified
Semi-supervised acoustic modelling for five-lingual code-switched ASR using automatically-segmented soap opera speech	Apr 8, 2020	Acoustic ModellingAction Detection	—Unverified
Progressive Boundary Refinement Network for Temporal Action Detection	Apr 3, 2020	Action Detection	—Unverified
Two-Stream AMTnet for Action Detection	Apr 3, 2020	Action DetectionAutonomous Driving	CodeCode Available
Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection	Apr 2, 2020	Action DetectionActivity Detection	—Unverified
Spatio-Temporal Action Detection with Multi-Object Interaction	Apr 1, 2020	Action DetectionHuman Detection	—Unverified
Revisiting Few-shot Activity Detection with Class Similarity Control	Mar 31, 2020	Action DetectionActivity Detection	—Unverified
Long Short-Term Relation Networks for Video Action Detection	Mar 31, 2020	Action DetectionObject	—Unverified
Dual Attention in Time and Frequency Domain for Voice Activity Detection	Mar 27, 2020	Action DetectionActivity Detection	CodeCode Available
Rethinking Online Action Detection in Untrimmed Videos: A Novel Online Evaluation Protocol	Mar 26, 2020	Action DetectionOnline Action Detection	CodeCode Available

Show:10 25 50

← PrevPage 12 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified