Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 817 papers

Title	Date	Tasks	Status	Hype
MMAD: Multi-label Micro-Action Detection in Videos	Jul 7, 2024	Action AnalysisAction Detection	CodeCode Available	1
TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR	Jul 5, 2024	Action DetectionActivity Detection	CodeCode Available	0
Micro-gesture Online Recognition using Learnable Query Points	Jul 5, 2024	Action Detection	—Unverified	0
DyFADet: Dynamic Feature Aggregation for Temporal Action Detection	Jul 3, 2024	Action DetectionDynamic neural networks	CodeCode Available	1
Automatic Speech Recognition for Hindi	Jun 26, 2024	Action DetectionActivity Detection	—Unverified	0
Benchmarking Deep Learning Models on NVIDIA Jetson Nano for Real-Time Systems: An Empirical Investigation	Jun 25, 2024	Action DetectionBenchmarking	CodeCode Available	0
Using joint angles based on the international biomechanical standards for human action recognition and related tasks	Jun 25, 2024	Action DetectionAction Recognition	—Unverified	0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024	Jun 24, 2024	Action DetectionActivity Detection	—Unverified	0
AnimalFormer: Multimodal Vision Framework for Behavior-based Precision Livestock Farming	Jun 14, 2024	Action DetectionActivity Detection	—Unverified	0
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness	Jun 12, 2024	Action DetectionActivity Detection	—Unverified	0
Vessel Re-identification and Activity Detection in Thermal Domain for Maritime Surveillance	Jun 12, 2024	Action DetectionActivity Detection	—Unverified	0
Deep Learning-Based Approach for User Activity Detection with Grant-Free Random Access in Cell-Free Massive MIMO	Jun 11, 2024	Action DetectionActivity Detection	—Unverified	0
An Effective-Efficient Approach for Dense Multi-Label Action Detection	Jun 10, 2024	Action Detection	—Unverified	0
InaGVAD : a Challenging French TV and Radio Corpus Annotated for Speech Activity Detection and Speaker Gender Segmentation	Jun 6, 2024	Action DetectionActivity Detection	CodeCode Available	1
Precise Analysis of Covariance Identifiability for Activity Detection in Grant-Free Random Access	Jun 3, 2024	Action DetectionActivity Detection	—Unverified	0
Object Aware Egocentric Online Action Detection	Jun 3, 2024	Action DetectionObject	—Unverified	0
Skeleton-OOD: An End-to-End Skeleton-Based Model for Robust Out-of-Distribution Human Action Detection	May 31, 2024	Action DetectionAction Recognition	CodeCode Available	0
MALT: Multi-scale Action Learning Transformer for Online Action Detection	May 31, 2024	Action DetectionDecoder	—Unverified	0
A Real-Time Voice Activity Detection Based On Lightweight Neural	May 27, 2024	Action DetectionActivity Detection	—Unverified	0
Open-Vocabulary Spatio-Temporal Action Detection	May 17, 2024	Action DetectionFine-Grained Action Detection	—Unverified	0
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization	May 15, 2024	Action DetectionActivity Detection	—Unverified	0
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding	May 14, 2024	Action DetectionGPU	CodeCode Available	1
A Semantic and Motion-Aware Spatiotemporal Transformer Network for Action Detection	May 13, 2024	Action Detection	—Unverified	0
Whispy: Adapting STT Whisper Models to Real-Time Environments	May 6, 2024	Action DetectionActivity Detection	—Unverified	0
Activity Detection for Massive Random Access using Covariance-based Matching Pursuit	May 4, 2024	Action DetectionActivity Detection	—Unverified	0
One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features	Apr 30, 2024	Action DetectionOpen-vocab Temporal Action Detection	CodeCode Available	0
FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method	Apr 28, 2024	Action DetectionActivity Detection	—Unverified	0
A Customer Level Fraudulent Activity Detection Benchmark for Enhancing Machine Learning Model Research and Evaluation	Apr 23, 2024	Action DetectionActivity Detection	—Unverified	0
Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection	Apr 17, 2024	3D Object DetectionAction Detection	—Unverified	0
STMixer: A One-Stage Sparse Action Detector	Apr 15, 2024	Action Detection	—Unverified	0
TIM: A Time Interval Machine for Audio-Visual Action Recognition	Apr 8, 2024	Action DetectionAction Recognition	CodeCode Available	2
UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection	Apr 7, 2024	Action DetectionMoment Queries	CodeCode Available	2
TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression	Apr 3, 2024	Action Detectionobject-detection	CodeCode Available	1
Action Detection via an Image Diffusion Process	Apr 1, 2024	Action DetectionImage Generation	—Unverified	0
Dual DETRs for Multi-Label Temporal Action Detection	Mar 31, 2024	Action Detectionobject-detection	—Unverified	0
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions	Mar 29, 2024	Action DetectionBenchmarking	CodeCode Available	1
Deep Learning-Assisted Parallel Interference Cancellation for Grant-Free NOMA in Machine-Type Communication	Mar 12, 2024	Action DetectionActivity Detection	—Unverified	0
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications	Mar 11, 2024	Action DetectionActivity Detection	—Unverified	0
Detection of Object Throwing Behavior in Surveillance Videos	Mar 11, 2024	Action DetectionAnomaly Detection	—Unverified	0
sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks	Mar 9, 2024	Action DetectionActivity Detection	—Unverified	0
High-speed Low-consumption sEMG-based Transient-state micro-Gesture Recognition	Mar 4, 2024	Action DetectionElectromyography (EMG)	—Unverified	0
Fast Low-parameter Video Activity Localization in Collaborative Learning Environments	Mar 2, 2024	Action DetectionActivity Detection	—Unverified	0
Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach	Feb 28, 2024	Action DetectionActivity Detection	—Unverified	0
Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection	Feb 13, 2024	Action DetectionActivity Detection	—Unverified	0
Device Activity Detection and Channel Estimation for Millimeter-Wave Massive MIMO	Feb 7, 2024	Action DetectionActivity Detection	—Unverified	0
A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model	Feb 5, 2024	Action DetectionActivity Detection	—Unverified	0
Joint User Detection and Localization in Near-Field Using Reconfigurable Intelligent Surfaces	Feb 4, 2024	Action DetectionActivity Detection	—Unverified	0
Online speaker diarization of meetings guided by speech separation	Jan 30, 2024	Action DetectionActivity Detection	CodeCode Available	1
Cutup and Detect: Human Fall Detection on Cutup Untrimmed Videos Using a Large Foundational Video Understanding Model	Jan 29, 2024	Action DetectionAction Localization	—Unverified	0
Self-supervised New Activity Detection in Sensor-based Smart Environments	Jan 17, 2024	Action DetectionActivity Detection	—Unverified	0

Show:10 25 50

← PrevPage 3 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified