Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 817 papers

Title	Date	Tasks	Status	Hype
Access Delay Constrained Activity Detection in Massive Random Access	Nov 4, 2021	Action DetectionActivity Detection	—Unverified	0
Revisiting spatio-temporal layouts for compositional action recognition	Nov 2, 2021	Action ClassificationAction Detection	CodeCode Available	1
AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence	Nov 2, 2021	Action DetectionActivity Detection	CodeCode Available	1
whu-nercms at trecvid2021:instance search task	Oct 30, 2021	Action DetectionFace Detection	—Unverified	0
Self-Denoising Neural Networks for Few Shot Learning	Oct 26, 2021	Action DetectionDenoising	—Unverified	0
CTRN: Class-Temporal Relational Network for Action Detection	Oct 26, 2021	Action Detection	—Unverified	0
AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation	Oct 21, 2021	Action DetectionTemporal Action Proposal Generation	CodeCode Available	1
LSTC: Boosting Atomic Action Detection with Long-Short-Term Context	Oct 19, 2021	Action DetectionAction Recognition	CodeCode Available	1
You Ought to Look Around: Precise, Large Span Action Detection	Oct 15, 2021	Action DetectionAction Localization	—Unverified	0
Object-Region Video Transformers	Oct 13, 2021	Action DetectionAction Recognition	CodeCode Available	1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications	Oct 12, 2021	Action DetectionActivity Detection	CodeCode Available	1
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR	Oct 7, 2021	Action DetectionActivity Detection	—Unverified	0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition	Oct 7, 2021	Action DetectionActivity Detection	—Unverified	0
Deep Learning-based Action Detection in Untrimmed Videos: A Survey	Sep 30, 2021	Action DetectionAction Recognition	—Unverified	0
Information Elevation Network for Fast Online Action Detection	Sep 28, 2021	Action DetectionAction Recognition	—Unverified	0
The VVAD-LRS3 Dataset for Visual Voice Activity Detection	Sep 28, 2021	Action DetectionActivity Detection	—Unverified	0
Towards High-Quality Temporal Action Detection with Sparse Proposals	Sep 18, 2021	Action DetectionAvg	CodeCode Available	1
The Stackelberg Equilibrium for One-sided Zero-sum Partially Observable Stochastic Games	Sep 17, 2021	Action Detection	—Unverified	0
Learning to Discriminate Information for Online Action Detection: Analysis and Application	Sep 8, 2021	Action AnticipationAction Detection	—Unverified	0
Class Semantics-based Attention for Action Detection	Sep 6, 2021	Action DetectionAction Localization	—Unverified	0
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge	Sep 5, 2021	Action DetectionActivity Detection	—Unverified	0
Identity-aware Graph Memory Network for Action Detection	Aug 26, 2021	Action DetectionGraph Neural Network	—Unverified	0
Sparse Signal Processing for Massive Connectivity via Mixed-Integer Programming	Aug 20, 2021	Action DetectionActivity Detection	—Unverified	0
Classification of Abnormal Hand Movement for Aiding in Autism Detection: Machine Learning Study	Aug 18, 2021	Action DetectionActivity Detection	CodeCode Available	1
Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection	Aug 8, 2021	Action DetectionKnowledge Distillation	—Unverified	0
Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker	Aug 7, 2021	Action DetectionActivity Detection	—Unverified	0
Video-guided Machine Translation with Spatial Hierarchical Attention Network	Aug 1, 2021	Action DetectionMachine Translation	—Unverified	0
Fine-Grained Classroom Activity Detection from Audio with Neural Networks	Jul 29, 2021	Action DetectionActivity Detection	CodeCode Available	0
Spot What Matters: Learning Context Using Graph Convolutional Networks for Weakly-Supervised Action Detection	Jul 28, 2021	Action DetectionHuman-Object Interaction Detection	CodeCode Available	1
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording	Jul 15, 2021	Action DetectionActivity Detection	—Unverified	0
Joint Activity Detection, Channel Estimation, and Data Decoding for Grant-free Massive Random Access	Jul 12, 2021	Action DetectionActivity Detection	—Unverified	0
RGB Stream Is Enough for Temporal Action Detection	Jul 9, 2021	Action DetectionData Augmentation	CodeCode Available	1
Long Short-Term Transformer for Online Action Detection	Jul 7, 2021	Action DetectionDecoder	CodeCode Available	1
SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection	Jun 29, 2021	Action Detection	—Unverified	0
Spatio-Temporal Context for Action Detection	Jun 29, 2021	Action DetectionVideo Understanding	—Unverified	0
Exploring Temporal Context and Human Movement Dynamics for Online Action Detection in Videos	Jun 26, 2021	Action DetectionAction Localization	—Unverified	0
Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets	Jun 25, 2021	Action DetectionActivity Detection	—Unverified	0
Dealing with training and test segmentation mismatch: FBK@IWSLT2021	Jun 23, 2021	Action DetectionActivity Detection	—Unverified	0
OadTR: Online Action Detection with Transformers	Jun 21, 2021	Action DetectionDecoder	CodeCode Available	1
EML Online Speech Activity Detection for the Fearless Steps Challenge Phase-III	Jun 21, 2021	Action DetectionActivity Detection	—Unverified	0
Proposal Relation Network for Temporal Action Detection	Jun 20, 2021	Action ClassificationAction Detection	CodeCode Available	1
Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection	Jun 19, 2021	Action DetectionPseudo Label	—Unverified	0
Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations	Jun 19, 2021	Action DetectionAction Localization	—Unverified	0
Algorithm Unrolling for Massive Access via Deep Neural Network with Theoretical Guarantee	Jun 19, 2021	Action DetectionActivity Detection	—Unverified	0
End-to-end Temporal Action Detection with Transformer	Jun 18, 2021	Action DetectionTemporal Action Localization	CodeCode Available	1
MaCLR: Motion-aware Contrastive Learning of Representations for Videos	Jun 17, 2021	Action DetectionAction Recognition	CodeCode Available	0
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection	Jun 16, 2021	Action DetectionAction Understanding	—Unverified	0
Relation Modeling in Spatio-Temporal Action Localization	Jun 15, 2021	Action DetectionAction Localization	—Unverified	0
A Stronger Baseline for Ego-Centric Action Detection	Jun 13, 2021	Action DetectionVideo Action Detection	—Unverified	0
WASE: Learning When to Attend for Speaker Extraction in Cocktail Party Environments	Jun 13, 2021	Action DetectionActivity Detection	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified