Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 817 papers

Title	Date	Tasks	Status
Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021	Dec 16, 2021	Action DetectionFine-Grained Action Detection	CodeCode Available
Continuous Human Action Detection Based on Wearable Inertial Data	Dec 11, 2021	Action DetectionGesture Recognition	—Unverified
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC	Dec 8, 2021	Action DetectionActivity Detection	—Unverified
Learning Proximal Operator Methods for Massive Connectivity in IoT Networks	Dec 6, 2021	Action DetectionActivity Detection	—Unverified
Reformulating Zero-shot Action Recognition for Multi-label Actions	Dec 1, 2021	Action ClassificationAction Detection	—Unverified
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information	Nov 28, 2021	Action DetectionActivity Detection	CodeCode Available
Weakly-guided Self-supervised Pretraining for Temporal Activity Detection	Nov 26, 2021	Action DetectionActivity Detection	CodeCode Available
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC	Nov 11, 2021	Action DetectionActivity Detection	—Unverified
Access Delay Constrained Activity Detection in Massive Random Access	Nov 4, 2021	Action DetectionActivity Detection	—Unverified
whu-nercms at trecvid2021:instance search task	Oct 30, 2021	Action DetectionFace Detection	—Unverified
Self-Denoising Neural Networks for Few Shot Learning	Oct 26, 2021	Action DetectionDenoising	—Unverified
CTRN: Class-Temporal Relational Network for Action Detection	Oct 26, 2021	Action Detection	—Unverified
You Ought to Look Around: Precise, Large Span Action Detection	Oct 15, 2021	Action DetectionAction Localization	—Unverified
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
Deep Learning-based Action Detection in Untrimmed Videos: A Survey	Sep 30, 2021	Action DetectionAction Recognition	—Unverified
Information Elevation Network for Fast Online Action Detection	Sep 28, 2021	Action DetectionAction Recognition	—Unverified
The VVAD-LRS3 Dataset for Visual Voice Activity Detection	Sep 28, 2021	Action DetectionActivity Detection	—Unverified
The Stackelberg Equilibrium for One-sided Zero-sum Partially Observable Stochastic Games	Sep 17, 2021	Action Detection	—Unverified
Learning to Discriminate Information for Online Action Detection: Analysis and Application	Sep 8, 2021	Action AnticipationAction Detection	—Unverified
Class Semantics-based Attention for Action Detection	Sep 6, 2021	Action DetectionAction Localization	—Unverified
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge	Sep 5, 2021	Action DetectionActivity Detection	—Unverified
Identity-aware Graph Memory Network for Action Detection	Aug 26, 2021	Action DetectionGraph Neural Network	—Unverified
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming	Aug 20, 2021	Action DetectionActivity Detection	—Unverified
Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection	Aug 8, 2021	Action DetectionKnowledge Distillation	—Unverified
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker	Aug 7, 2021	Action DetectionActivity Detection	—Unverified
Video-guided Machine Translation with Spatial Hierarchical Attention Network	Aug 1, 2021	Action DetectionMachine Translation	—Unverified
Fine-Grained Classroom Activity Detection from Audio with Neural Networks	Jul 29, 2021	Action DetectionActivity Detection	CodeCode Available
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording	Jul 15, 2021	Action DetectionActivity Detection	—Unverified
Joint Activity Detection, Channel Estimation, and Data Decoding for Grant-free Massive Random Access	Jul 12, 2021	Action DetectionActivity Detection	—Unverified
Spatio-Temporal Context for Action Detection	Jun 29, 2021	Action DetectionVideo Understanding	—Unverified
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection	Jun 29, 2021	Action Detection	—Unverified
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos	Jun 26, 2021	Action DetectionAction Localization	—Unverified
Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets	Jun 25, 2021	Action DetectionActivity Detection	—Unverified
Dealing with training and test segmentation mismatch: FBK@IWSLT2021	Jun 23, 2021	Action DetectionActivity Detection	—Unverified
EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III	Jun 21, 2021	Action DetectionActivity Detection	—Unverified
Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection	Jun 19, 2021	Action DetectionPseudo Label	—Unverified
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations	Jun 19, 2021	Action DetectionAction Localization	—Unverified
Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee	Jun 19, 2021	Action DetectionActivity Detection	—Unverified
MaCLR: Motion-aware Contrastive Learning of Representations for Videos	Jun 17, 2021	Action DetectionAction Recognition	CodeCode Available
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection	Jun 16, 2021	Action DetectionAction Understanding	—Unverified
Relation Modeling in Spatio-Temporal Action Localization	Jun 15, 2021	Action DetectionAction Localization	—Unverified
A Stronger Baseline for Ego-Centric Action Detection	Jun 13, 2021	Action DetectionVideo Action Detection	—Unverified
Joint Channel Estimation and Device Activity Detection in Heterogeneous Networks	May 27, 2021	Action DetectionActivity Detection	—Unverified
PLSM: A Parallelized Liquid State Machine for Unintentional Action Detection	May 6, 2021	Action DetectionGPU	CodeCode Available
Accelerating Coordinate Descent via Active Set Selection for Device Activity Detection for Multi-Cell Massive Random Access	Apr 27, 2021	Action DetectionActivity Detection	—Unverified
Joint Activity Detection and Data Decoding in Massive Random Access via a Turbo Receiver	Apr 26, 2021	Action DetectionActivity Detection	—Unverified
Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation	Apr 23, 2021	Action DetectionActivity Detection	—Unverified
Spatial Correlation Aware Compressed Sensing for User Activity Detection and Channel Estimation in Massive MTC	Apr 17, 2021	Action DetectionActivity Detection	—Unverified
Spatiotemporal Deformable Scene Graphs for Complex Activity Detection	Apr 16, 2021	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 10 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified