Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–600 of 817 papers

Title	Date	Tasks	Status
Understanding Policy and Technical Aspects of AI-Enabled Smart Video Surveillance to Address Public Safety	Feb 8, 2023	Action DetectionAnomaly Detection	—Unverified
Unfolding Videos Dynamics via Taylor Expansion	Sep 4, 2024	Action DetectionAction Recognition	—Unverified
Unified Graph Structured Models for Video Understanding	Mar 29, 2021	Action DetectionGraph Classification	—Unverified
Union of Low-Rank Subspaces Detector	Jul 29, 2013	Action DetectionActivity Detection	—Unverified
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection	Jan 7, 2025	Action DetectionActivity Detection	—Unverified
Unsupervised Action Proposal Ranking through Proposal Recombination	Apr 3, 2017	Action DetectionAction Recognition	—Unverified
Unsupervised Human Action Detection by Action Matching	Dec 2, 2016	Action DetectionActivity Recognition	—Unverified
Untrimmed Action Anticipation	Feb 8, 2022	Action AnticipationAction Detection	—Unverified
Unveiling ECC Vulnerabilities: LSTM Networks for Operation Recognition in Side-Channel Attacks	Feb 24, 2025	Action DetectionActivity Detection	—Unverified
Unveiling the Power of Complex-Valued Transformers in Wireless Communications	Feb 16, 2025	Action DetectionActivity Detection	—Unverified
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC	Dec 8, 2021	Action DetectionActivity Detection	—Unverified
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC	Nov 11, 2021	Action DetectionActivity Detection	—Unverified
User Activity Detection with Delay-Calibration for Asynchronous Massive Random Access	Nov 4, 2024	Action DetectionActivity Detection	—Unverified
User Adaptive Restoration for Incorrectly-Segmented Utterances in Spoken Dialogue Systems	Sep 1, 2015	Action DetectionSpeech Recognition	—Unverified
Using joint angles based on the international biomechanical standards for human action recognition and related tasks	Jun 25, 2024	Action DetectionAction Recognition	—Unverified
USTC-NELSLIP System Description for DIHARD-III Challenge	Mar 19, 2021	Action DetectionActivity Detection	—Unverified
VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording	Jul 15, 2021	Action DetectionActivity Detection	—Unverified
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition	Feb 22, 2022	Action DetectionActivity Detection	—Unverified
VAST: A Corpus of Video Annotation for Speech Technologies	May 1, 2018	Action DetectionLanguage Identification	—Unverified
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance	Jun 12, 2024	Action DetectionActivity Detection	—Unverified
Video Action Detection: Analysing Limitations and Challenges	Apr 17, 2022	Action DetectionVideo Action Detection	—Unverified
VideoCapsuleNet: A Simplified Network for Action Detection	May 21, 2018	Action ClassificationAction Detection	—Unverified
Video Event Detection by Exploiting Word Dependencies from Image Captions	Dec 1, 2016	Action DetectionEvent Detection	—Unverified
Video-guided Machine Translation with Spatial Hierarchical Attention Network	Aug 1, 2021	Action DetectionMachine Translation	—Unverified
vireoJD-MM at Activity Detection in Extended Videos	Jun 20, 2019	Action DetectionAction Localization	—Unverified
Voice Activity Detection for Transient Noisy Environment Based on Diffusion Nets	Jun 25, 2021	Action DetectionActivity Detection	—Unverified
Voice Activity Detection using Temporal Characteristics of Autocorrelation Lag and Maximum Spectral Amplitude in Sub-bands	Dec 1, 2014	Action DetectionActivity Detection	—Unverified
VOXLINGUA107: A DATASET FOR SPOKEN LANGUAGE RECOGNITION	Nov 25, 2020	Action DetectionActivity Detection	—Unverified
VSANet: Real-time Speech Enhancement Based on Voice Activity Detection and Causal Spatial Attention	Oct 11, 2023	Action DetectionActivity Detection	—Unverified
Watch Only Once: An End-to-End Video Action Detection Framework	Jan 1, 2021	Action ClassificationAction Detection	—Unverified
Weakly-Supervised Action Detection Guided by Audio Narration	May 12, 2022	Action Detection	—Unverified
Weakly Supervised Gaussian Networks for Action Detection	Apr 16, 2019	Action DetectionAction Localization	—Unverified
Whispy: Adapting STT Whisper Models to Real-Time Environments	May 6, 2024	Action DetectionActivity Detection	—Unverified
WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos	Jun 5, 2020	Action DetectionAction Recognition	—Unverified
You Ought to Look Around: Precise, Large Span Action Detection	Oct 15, 2021	Action DetectionAction Localization	—Unverified
ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection	Nov 1, 2023	Action DetectionClassification	—Unverified
DASZL: Dynamic Action Signatures for Zero-shot Learning	Dec 8, 2019	Action DetectionActivity Detection	—Unverified
Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning	Apr 6, 2021	Action ClassificationAction Detection	—Unverified
A Proposed Artificial intelligence Model for Real-Time Human Action Localization and Tracking	Nov 9, 2019	Action DetectionAction Localization	—Unverified
Multi-Stream Single Shot Spatial-Temporal Action Detection	Aug 22, 2019	Action DetectionOptical Flow Estimation	—Unverified
Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention	May 8, 2020	Action DetectionActivity Detection	—Unverified
Multi-task Self-Supervised Learning for Human Activity Detection	Jul 27, 2019	Action DetectionActivity Detection	—Unverified
Multi-Task Sub-Band Network For Deep Residual Echo Suppression	Mar 11, 2023	Action DetectionActivity Detection	—Unverified
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle	Nov 19, 2022	Action DetectionActivity Detection	—Unverified
Multi-timescale Trajectory Prediction for Abnormal Human Activity Detection	Aug 12, 2019	Action DetectionActivity Detection	—Unverified
Neural Dialogue Context Online End-of-Turn Detection	Jul 1, 2018	Action DetectionSpoken Dialogue Systems	—Unverified
Representation Learning on Visual-Symbolic Graphs for Video Understanding	May 17, 2019	Action ClassificationAction Detection	—Unverified
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining	Jan 6, 2025	Action DetectionActivity Detection	—Unverified
NTT Multi-Speaker ASR System for the DASR Task of CHiME-8 Challenge	Sep 9, 2024	Action DetectionActivity Detection	—Unverified
Nudge: Accelerating Overdue Pull Requests Towards Completion	Nov 25, 2020	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 12 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified