Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 817 papers

Title	Date	Tasks	Status	Hype
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans	Jun 25, 2025	Action DetectionBenchmarking	—Unverified	0
CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment	Jun 25, 2025	Action DetectionActivity Detection	—Unverified	0
Distributed Activity Detection for Cell-Free Hybrid Near-Far Field Communications	Jun 17, 2025	Action DetectionActivity Detection	—Unverified	0
Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm	Jun 3, 2025	Action DetectionActivity Detection	CodeCode Available	1
Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion	Jun 2, 2025	Action DetectionActivity Detection	—Unverified	0
Joint Activity Detection and Channel Estimation for Massive Connectivity: Where Message Passing Meets Score-Based Generative Priors	May 31, 2025	Action DetectionActivity Detection	—Unverified	0
Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM	May 29, 2025	Action DetectionActivity Detection	—Unverified	0
Robust Activity Detection for Massive Random Access	May 21, 2025	Action DetectionActivity Detection	—Unverified	0
Improving endpoint detection in end-to-end streaming ASR for conversational speech	May 19, 2025	Action DetectionActivity Detection	—Unverified	0
Multi-Stage Speaker Diarization for Noisy Classrooms	May 16, 2025	Action DetectionActivity Detection	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 82Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified