Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 817 papers

Title	Date	Tasks	Status	Hype
Smart Black Box 2.0: Efficient High-bandwidth Driving Data Collection based on Video Anomalies	Jan 3, 2021	Action DetectionAnomaly Detection	—Unverified	0
Watch Only Once: An End-to-End Video Action Detection Framework	Jan 1, 2021	Action ClassificationAction Detection	—Unverified	0
Towards Improving Spatiotemporal Action Recognition in Videos	Dec 15, 2020	Action DetectionAction Localization	CodeCode Available	0
AV Taris: Online Audio-Visual Speech Recognition	Dec 14, 2020	Action DetectionActivity Detection	CodeCode Available	1
Spatial-Temporal Alignment Network for Action Recognition and Detection	Dec 4, 2020	Action DetectionAction Recognition	—Unverified	0
MEVA: A Large-Scale Multiview, Multimodal Video Dataset for Activity Detection	Dec 2, 2020	Action DetectionActivity Detection	—Unverified	0
VoxLingua107: a Dataset for Spoken Language Recognition	Nov 25, 2020	Action DetectionActivity Detection	CodeCode Available	1
VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION	Nov 25, 2020	Action DetectionActivity Detection	—Unverified	0
Nudge: Accelerating Overdue Pull Requests Towards Completion	Nov 25, 2020	Action DetectionActivity Detection	—Unverified	0
Temporal Action Detection with Multi-level Supervision	Nov 24, 2020	Action DetectionSemi-Supervised Action Detection	—Unverified	0
We don't Need Thousand Proposals Single Shot Actor-Action Detection in Videos	Nov 22, 2020	Action Detection	CodeCode Available	0
From Recognition to Prediction: Analysis of Human Action and Trajectory Prediction in Video	Nov 20, 2020	Action DetectionAutonomous Driving	CodeCode Available	1
Privileged Knowledge Distillation for Online Action Detection	Nov 18, 2020	Action DetectionKnowledge Distillation	—Unverified	0
A Time-Frequency based Suspicious Activity Detection for Anti-Money Laundering	Nov 17, 2020	Action DetectionActivity Detection	—Unverified	0
LAP-Net: Adaptive Features Sampling via Learning Action Progression for Online Action Detection	Nov 16, 2020	Action DetectionOnline Action Detection	—Unverified	0
SALAD: Self-Assessment Learning for Action Detection	Nov 13, 2020	Action DetectionAction Localization	—Unverified	0
Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection	Oct 28, 2020	Action DetectionActivity Detection	CodeCode Available	0
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection	Oct 26, 2020	Action DetectionActivity Detection	—Unverified	0
Activity Detection And Modeling Using Smart Meter Data: Concept And Case Studies	Oct 26, 2020	Action DetectionActivity Detection	—Unverified	0
Multi-Channel Speaker Verification for Single and Multi-talker Speech	Oct 23, 2020	Action DetectionActivity Detection	—Unverified	0
Speech enhancement aided end-to-end multi-task learning for voice activity detection	Oct 23, 2020	Action DetectionActivity Detection	—Unverified	0
Combination of Deep Speaker Embeddings for Diarisation	Oct 22, 2020	Action DetectionActivity Detection	—Unverified	0
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge	Oct 22, 2020	Action DetectionActivity Detection	—Unverified	0
An Efficient Algorithm for Device Detection and Channel Estimation in Asynchronous IoT Systems	Oct 20, 2020	Action DetectionActivity Detection	—Unverified	0
Robust Two-Stream Multi-Feature Network for Driver Drowsiness Detection	Oct 13, 2020	Action Detectionimage-classification	—Unverified	0
The "Sound of Silence" in EEG -- Cognitive voice activity detection	Oct 12, 2020	Action DetectionActivity Detection	—Unverified	0
Online Action Detection in Streaming Videos with Time Buffers	Oct 6, 2020	Action DetectionOnline Action Detection	—Unverified	0
A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments	Oct 6, 2020	Action DetectionActivity Detection	—Unverified	0
Grant-Free Access via Bilinear Inference for Cell-Free MIMO with Low-Coherent Pilots	Sep 27, 2020	Action DetectionActivity Detection	—Unverified	0
Learning Visual Voice Activity Detection with an Automatically Annotated Dataset	Sep 23, 2020	Action DetectionActivity Detection	—Unverified	0
TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search & Retrieval	Sep 21, 2020	Action DetectionActivity Detection	—Unverified	0
On Multitask Loss Function for Audio Event Detection and Localization	Sep 11, 2020	Action DetectionActivity Detection	—Unverified	0
Massive Machine Type Communication Pilot-Hopping Sequence Detection Architectures Based on Non-Negative Least Squares for Grant-Free Random Access	Sep 4, 2020	Action DetectionActivity Detection	—Unverified	0
Online Spatiotemporal Action Detection and Prediction via Causal Representations	Aug 31, 2020	Action DetectionAction Recognition	CodeCode Available	0
Finding Action Tubes with a Sparse-to-Dense Framework	Aug 30, 2020	Action Detection	—Unverified	0
RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns	Aug 21, 2020	Action DetectionActivity Detection	CodeCode Available	0
SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras	Aug 19, 2020	Action DetectionActivity Detection	—Unverified	0
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization	Aug 19, 2020	Action DetectionAction Localization	—Unverified	0
MLNET: An Adaptive Multiple Receptive-field Attention Neural Network for Voice Activity Detection	Aug 13, 2020	Action DetectionActivity Detection	—Unverified	0
A Multi-Task Learning Approach for Human Activity Segmentation and Ergonomics Risk Assessment	Aug 7, 2020	Action DetectionDecoder	CodeCode Available	0
Multi-Level Temporal Pyramid Network for Action Detection	Aug 7, 2020	Action Detection	—Unverified	0
Jointly Sparse Signal Recovery and Support Recovery via Deep Learning with Applications in MIMO-based Grant-Free Random Access	Aug 5, 2020	Action DetectionActivity Detection	—Unverified	0
Boundary Content Graph Neural Network for Temporal Action Proposal Generation	Aug 4, 2020	Action DetectionAction Understanding	—Unverified	0
"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)	Aug 4, 2020	Action DetectionActivity Detection	—Unverified	0
Towards Efficient Coarse-to-Fine Networks for Action and Gesture Recognition	Aug 1, 2020	3D Action RecognitionAction Classification	—Unverified	0
Weight Excitation: Built-in Attention Mechanisms in Convolutional Neural Networks	Aug 1, 2020	3D Action Recognition3D Classification	CodeCode Available	1
Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments	Jul 28, 2020	Action DetectionActivity Detection	—Unverified	0
Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos	Jul 21, 2020	Action DetectionAction Recognition	—Unverified	0
Context-Aware RCNN: A Baseline for Action Detection in Videos	Jul 20, 2020	Action DetectionAction Recognition	CodeCode Available	1
AViD Dataset: Anonymized Videos from Diverse Countries	Jul 10, 2020	Action ClassificationAction Detection	CodeCode Available	1

Show:10 25 50

← PrevPage 11 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified