Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 817 papers

Title	Date	Tasks	Status
Joint Activity Detection and Channel Estimation for Massive Connectivity: Where Message Passing Meets Score-Based Generative Priors	May 31, 2025	Action DetectionActivity Detection	—Unverified
Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM	May 29, 2025	Action DetectionActivity Detection	—Unverified
Robust Activity Detection for Massive Random Access	May 21, 2025	Action DetectionActivity Detection	—Unverified
Improving endpoint detection in end-to-end streaming ASR for conversational speech	May 19, 2025	Action DetectionActivity Detection	—Unverified
Multi-Stage Speaker Diarization for Noisy Classrooms	May 16, 2025	Action DetectionActivity Detection	CodeCode Available
Beyond Pixels: Leveraging the Language of Soccer to Improve Spatio-Temporal Action Detection in Broadcast Videos	May 14, 2025	Action DetectionDecoder	—Unverified
Sensing Framework Design and Performance Optimization with Action Detection for ISCC	May 5, 2025	Action Detection	—Unverified
Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection	Apr 20, 2025	Action DetectionDecoder	—Unverified
MicroNAS: An Automated Framework for Developing a Fall Detection System	Apr 10, 2025	Action DetectionActivity Detection	—Unverified
Scaling Open-Vocabulary Action Detection	Apr 4, 2025	Action DetectionMultiple Action Detection	CodeCode Available
FDDet: Frequency-Decoupling for Boundary Refinement in Temporal Action Detection	Apr 1, 2025	Action DetectionRelation Network	—Unverified
Temporal Action Detection Model Compression by Progressive Block Drop	Mar 21, 2025	Action DetectionAutonomous Driving	—Unverified
Fast MLE and MAPE-Based Device Activity Detection for Grant-Free Access via PSCA and PSCA-Net	Mar 19, 2025	Action DetectionActivity Detection	—Unverified
ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing	Mar 17, 2025	Action DetectionDisaster Response	—Unverified
Lightweight Learning for Grant-Free Activity Detection in Cell-Free Massive MIMO Networks	Mar 14, 2025	Action DetectionActivity Detection	—Unverified
Federated Learning for Secure and Efficient Device Activity Detection in mMTC Networks	Mar 14, 2025	Action DetectionActivity Detection	—Unverified
Robust Learning-Based Sparse Recovery for Device Activity Detection in Grant-Free Random Access Cell-Free Massive MIMO: Enhancing Resilience to Impairments	Mar 13, 2025	Action DetectionActivity Detection	—Unverified
CADDI: An in-Class Activity Detection Dataset using IMU data from low-cost sensors	Mar 4, 2025	Action DetectionActivity Detection	—Unverified
Optimizing Large Language Models for ESG Activity Detection in Financial Texts	Feb 28, 2025	Action DetectionActivity Detection	CodeCode Available
Mixture of Experts-augmented Deep Unfolding for Activity Detection in IRS-aided Systems	Feb 27, 2025	Action DetectionActivity Detection	—Unverified
Unveiling ECC Vulnerabilities: LSTM Networks for Operation Recognition in Side-Channel Attacks	Feb 24, 2025	Action DetectionActivity Detection	—Unverified
Game State and Spatio-temporal Action Detection in Soccer using Graph Neural Networks and 3D Convolutional Networks	Feb 21, 2025	Action Detection	—Unverified
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems	Feb 19, 2025	Action DetectionActivity Detection	—Unverified
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems	Feb 19, 2025	Action DetectionActivity Detection	—Unverified
Unveiling the Power of Complex-Valued Transformers in Wireless Communications	Feb 16, 2025	Action DetectionActivity Detection	—Unverified
DT4ECG: A Dual-Task Learning Framework for ECG-Based Human Identity Recognition and Human Activity Detection	Feb 16, 2025	Action DetectionActivity Detection	—Unverified
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge	Feb 14, 2025	Action DetectionActivity Detection	—Unverified
When do they StOP?: A First Step Towards Automatically Identifying Team Communication in the Operating Room	Feb 12, 2025	Action DetectionActivity Detection	CodeCode Available
Pre-Equalization Aided Grant-Free Massive Access in Massive MIMO System	Feb 10, 2025	Action DetectionActivity Detection	CodeCode Available
An Automated Machine Learning Framework for Surgical Suturing Action Detection under Class Imbalance	Feb 10, 2025	Action Detection	—Unverified
Deconstruct Complexity (DeComplex): A Novel Perspective on Tackling Dense Action Detection	Jan 30, 2025	Action DetectionContrastive Learning	—Unverified
Automatic detection and prediction of nAMD activity change in retinal OCT using Siamese networks and Wasserstein Distance for ordinality	Jan 24, 2025	Action DetectionActivity Detection	CodeCode Available
Text-driven Online Action Detection	Jan 23, 2025	Action DetectionAutonomous Driving	CodeCode Available
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection	Jan 7, 2025	Action DetectionActivity Detection	—Unverified
Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining	Jan 6, 2025	Action DetectionActivity Detection	—Unverified
Fotheidil: an Automatic Transcription System for the Irish Language	Dec 31, 2024	Action DetectionActivity Detection	—Unverified
Action-Agnostic Point-Level Supervision for Temporal Action Detection	Dec 30, 2024	Action Detection	CodeCode Available
Dataset for Real-World Human Action Detection Using FMCW mmWave Radar	Dec 23, 2024	Action DetectionPrivacy Preserving	—Unverified
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts	Dec 18, 2024	Action DetectionDescriptive	CodeCode Available
Stable Mean Teacher for Semi-supervised Video Action Detection	Dec 10, 2024	Action DetectionSemantic Segmentation	CodeCode Available
Comparative Analysis of Deep Learning Approaches for Harmful Brain Activity Detection Using EEG	Dec 10, 2024	Action DetectionActivity Detection	—Unverified
Asynchronous Random Access in Massive MIMO Systems Facilitated by the Delay-Angle Domain	Dec 6, 2024	Action DetectionActivity Detection	—Unverified
Continual Low-Rank Scaled Dot-product Attention	Dec 4, 2024	Action DetectionAudio Classification	—Unverified
Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment	Dec 1, 2024	Action DetectionActivity Detection	CodeCode Available
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation	Nov 21, 2024	Action DetectionActivity Detection	—Unverified
Transferable Adversarial Attacks against ASR	Nov 14, 2024	Action DetectionActivity Detection	—Unverified
A Flexible Framework for Grant-Free Random Access in Cell-Free Massive MIMO Systems	Nov 14, 2024	Action DetectionActivity Detection	—Unverified
On the Detection of Non-Cooperative RISs: Scan B-Testing via Deep Support Vector Data Description	Nov 5, 2024	Action DetectionActivity Detection	—Unverified
Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization	Nov 4, 2024	Action DetectionActivity Detection	—Unverified
Intelligent Video Recording Optimization using Activity Detection for Surveillance Systems	Nov 4, 2024	Action DetectionActivity Detection	—Unverified

Show:10 25 50

← PrevPage 4 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified