Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 817 papers

Title	Date	Tasks	Status
Baseline Method for the Sport Task of MediaEval 2022 with 3D CNNs using Attention Mechanisms	Feb 6, 2023	Action ClassificationAction Detection	CodeCode Available
Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022	Jan 31, 2023	Action DetectionBenchmarking	CodeCode Available
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description	Jan 17, 2023	Action DetectionActivity Detection	—Unverified
Deep learning-based approaches for human motion decoding in smart walkers for rehabilitation	Jan 13, 2023	Action DetectionAction Recognition	—Unverified
KIDS: kinematics-based (in)activity detection and segmentation in a sleep case study	Jan 4, 2023	Action DetectionActivity Detection	—Unverified
Ego-Only: Egocentric Action Detection without Exocentric Transferring	Jan 3, 2023	Action DetectionAction Localization	—Unverified
SkeleTR: Towards Skeleton-based Action Recognition in the Wild	Jan 1, 2023	Action ClassificationAction Detection	—Unverified
Hybrid Active Learning via Deep Clustering for Video Action Detection	Jan 1, 2023	Action DetectionActive Learning	—Unverified
Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection	Jan 1, 2023	Action Detection	—Unverified
Activity Detection for Grant-Free NOMA in Massive IoT Networks	Dec 23, 2022	Action DetectionActivity Detection	—Unverified
Open-Vocabulary Temporal Action Detection with Off-the-Shelf Image-Text Features	Dec 20, 2022	Action DetectionOptical Flow Estimation	—Unverified
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks	Dec 14, 2022	Action DetectionActivity Detection	—Unverified
Trajectory-User Linking Is Easier Than You Think	Dec 14, 2022	Action DetectionActivity Detection	—Unverified
Contextual Explainable Video Representation: Human Perception-based Understanding	Dec 12, 2022	Action DetectionAction Recognition	CodeCode Available
BC-VAD: A Robust Bone Conduction Voice Activity Detection	Dec 6, 2022	Action DetectionActivity Detection	—Unverified
Proximal Gradient-Based Unfolding for Massive Random Access in IoT Networks	Dec 4, 2022	Action DetectionActivity Detection	—Unverified
Joint Estimation of Clustered User Activity and Correlated Channels with Unknown Covariance in mMTC	Nov 30, 2022	Action DetectionActivity Detection	—Unverified
Multi-timescale Event Detection in Nonintrusive Load Monitoring based on MDL Principle	Nov 19, 2022	Action DetectionActivity Detection	—Unverified
On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches	Nov 16, 2022	Action DetectionActivity Detection	—Unverified
Token Turing Machines	Nov 16, 2022	Action DetectionActivity Detection	—Unverified
Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection	Nov 8, 2022	Action DetectionActivity Detection	—Unverified
OFDM-Based Massive Connectivity for LEO Satellite Internet of Things	Oct 31, 2022	Action DetectionActivity Detection	—Unverified
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition	Oct 28, 2022	Action DetectionActivity Detection	—Unverified
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction	Oct 28, 2022	Action DetectionActivity Detection	—Unverified
Handwashing Action Detection System for an Autonomous Social Robot	Oct 27, 2022	Action DetectionAction Recognition	CodeCode Available
TSUP Speaker Diarization System for Conversational Short-phrase Speaker Diarization Challenge	Oct 26, 2022	Action DetectionActivity Detection	—Unverified
Refining Action Boundaries for One-stage Detection	Oct 25, 2022	Action Detection	CodeCode Available
mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors	Oct 15, 2022	3D Human Pose EstimationAction Detection	—Unverified
Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization	Oct 14, 2022	Action DetectionActive Speaker Detection	—Unverified
Application-Driven AI Paradigm for Hand-Held Action Detection	Oct 13, 2022	Action DetectionObject	—Unverified
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022	Oct 4, 2022	Action DetectionActivity Detection	—Unverified
Learnable Acoustic Frontends in Bird Activity Detection	Oct 3, 2022	Action DetectionActivity Detection	—Unverified
Signed Latent Factors for Spamming Activity Detection	Sep 28, 2022	Action DetectionActivity Detection	—Unverified
RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical Flow	Sep 28, 2022	Action ClassificationAction Detection	CodeCode Available
Joint Speech Activity and Overlap Detection with Multi-Exit Architecture	Sep 24, 2022	Action DetectionActivity Detection	—Unverified
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022	Sep 23, 2022	Action DetectionActivity Detection	—Unverified
Cross-domain Voice Activity Detection with Self-Supervised Representations	Sep 22, 2022	Action DetectionActivity Detection	—Unverified
GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge	Sep 21, 2022	Action DetectionActivity Detection	—Unverified
Exploring Modulated Detection Transformer as a Tool for Action Recognition in Videos	Sep 21, 2022	Action DetectionAction Recognition	CodeCode Available
Hardware Accelerator and Neural Network Co-Optimization for Ultra-Low-Power Audio Processing Devices	Sep 8, 2022	Action DetectionActivity Detection	—Unverified
Spatio-Temporal Action Detection Under Large Motion	Sep 6, 2022	Action Detection	CodeCode Available
A Circular Window-based Cascade Transformer for Online Action Detection	Aug 30, 2022	Action DetectionAction Segmentation	—Unverified
Target Speaker Voice Activity Detection with Transformers and Its Integration with End-to-End Neural Diarization	Aug 27, 2022	Action DetectionActivity Detection	—Unverified
Actor-identified Spatiotemporal Action Detection --- Detecting Who Is Doing What in Videos	Aug 27, 2022	Action ClassificationAction Detection	CodeCode Available
Enabling Weakly-Supervised Temporal Action Localization from On-Device Learning of the Video Stream	Aug 25, 2022	Action DetectionAction Localization	—Unverified
Review on Action Recognition for Accident Detection in Smart City Transportation Systems	Aug 20, 2022	Action DetectionAction Recognition	—Unverified
Weakly Supervised Online Action Detection for Infant General Movements	Aug 7, 2022	Action DetectionClassification	CodeCode Available
P2ANet: A Dataset and Benchmark for Dense Action Detection from Table Tennis Match Broadcasting Videos	Jul 26, 2022	Action DetectionAction Localization	—Unverified
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation	Jul 26, 2022	Action DetectionDescriptive	—Unverified
An Efficient Spatio-Temporal Pyramid Transformer for Action Detection	Jul 21, 2022	Action DetectionVideo Understanding	—Unverified

Show:10 25 50

← PrevPage 8 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified