Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 817 papers

Title	Date	Tasks	Status	Hype
Automated speech tools for helping communities process restricted-access corpora for language revival efforts	Apr 15, 2022	Action DetectionActivity Detection	—Unverified	0
Anomalous Sound Detection Based on Machine Activity Detection	Apr 15, 2022	Action DetectionActivity Detection	—Unverified	0
CholecTriplet2021: A benchmark challenge for surgical action triplet recognition	Apr 10, 2022	Action DetectionAction Triplet Recognition	CodeCode Available	1
E^2TAD: An Energy-Efficient Tracking-based Action Detector	Apr 9, 2022	Action DetectionAction Localization	CodeCode Available	1
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition	Apr 8, 2022	Action DetectionActivity Detection	—Unverified	0
An Empirical Study of End-to-End Temporal Action Detection	Apr 6, 2022	Action ClassificationAction Detection	CodeCode Available	1
Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network	Apr 6, 2022	Action DetectionAction Spotting	—Unverified	0
Low-Latency Speech Separation Guided Diarization for Telephone Conversations	Apr 5, 2022	Action DetectionActivity Detection	CodeCode Available	1
Gan-Based Joint Activity Detection and Channel Estimation For Grant-free Random Access	Apr 4, 2022	Action DetectionActivity Detection	CodeCode Available	0
Stochastic Backpropagation: A Memory Efficient Strategy for Training Video Models	Mar 31, 2022	Action DetectionAction Recognition	CodeCode Available	1
Deep Learning for Encrypted Traffic Classification and Unknown Data Detection	Mar 25, 2022	Action DetectionActivity Detection	—Unverified	0
Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios	Mar 18, 2022	Action DetectionActivity Detection	—Unverified	0
ABN: Agent-Aware Boundary Networks for Temporal Action Proposal Generation	Mar 16, 2022	Action DetectionTemporal Action Proposal Generation	CodeCode Available	0
RCL: Recurrent Continuous Localization for Temporal Action Detection	Mar 14, 2022	Action Detection	—Unverified	0
Context-LSTM: a robust classifier for video detection on UCF101	Mar 13, 2022	Action DetectionAction Recognition	—Unverified	0
Human Attention Detection Using AM-FM Representations	Mar 9, 2022	Action DetectionActivity Detection	—Unverified	0
PAMI-AD: An Activity Detector Exploiting Part-attention and Motion Information in Surveillance Videos	Mar 8, 2022	Action DetectionActivity Detection	—Unverified	0
End-to-End Semi-Supervised Learning for Video Action Detection	Mar 8, 2022	Action DetectionClassification Consistency	CodeCode Available	1
SegTAD: Precise Temporal Action Detection via Semantic Segmentation	Mar 3, 2022	Action Detectionobject-detection	—Unverified	0
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars	Mar 2, 2022	Action DetectionOnline Action Detection	CodeCode Available	2
Random Access with Massive MIMO-OTFS in LEO Satellite Communications	Feb 26, 2022	Action DetectionActivity Detection	—Unverified	0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition	Feb 22, 2022	Action DetectionActivity Detection	—Unverified	0
Active Privacy-Utility Trade-off Against Inference in Time-Series Data Sharing	Feb 11, 2022	Action DetectionActivity Detection	—Unverified	0
The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge	Feb 10, 2022	Action DetectionActivity Detection	—Unverified	0
Untrimmed Action Anticipation	Feb 8, 2022	Action AnticipationAction Detection	—Unverified	0
Cross-Channel Attention-Based Target Speaker Voice Activity Detection: Experimental Results for M2MeT Challenge	Feb 6, 2022	Action DetectionActivity Detection	—Unverified	0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge	Feb 4, 2022	Action DetectionActivity Detection	—Unverified	0
HGCN: Harmonic gated compensation network for speech enhancement	Jan 30, 2022	Action DetectionActivity Detection	CodeCode Available	1
NAS-VAD: Neural Architecture Search for Voice Activity Detection	Jan 22, 2022	Action DetectionActivity Detection	CodeCode Available	1
Continual Transformers: Redundancy-Free Attention for Online Inference	Jan 17, 2022	Action DetectionAudio Classification	CodeCode Available	1
Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals	Jan 14, 2022	Action DetectionActivity Detection	—Unverified	0
Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization	Jan 6, 2022	Action DetectionActive Speaker Detection	—Unverified	0
Exploiting Temporal Side Information in Massive IoT Connectivity	Jan 5, 2022	Action DetectionActivity Detection	CodeCode Available	1
Merry Go Round: Rotate a Frame and Fool a DNN	Jan 1, 2022	Action DetectionActivity Detection	—Unverified	0
Binary Image Skeletonization Using 2-Stage U-Net	Dec 22, 2021	Action DetectionActivity Detection	—Unverified	0
Two Stream Network for Stroke Detection in Table Tennis	Dec 16, 2021	Action DetectionVocal Bursts Valence Prediction	—Unverified	0
Spatio-Temporal CNN baseline method for the Sports Video Task of MediaEval 2021 benchmark	Dec 16, 2021	Action ClassificationAction Detection	CodeCode Available	0
Sports Video: Fine-Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2021	Dec 16, 2021	Action DetectionFine-Grained Action Detection	CodeCode Available	0
Low Resource Species Agnostic Bird Activity Detection	Dec 16, 2021	Action DetectionActivity Detection	—Unverified	0
SVIP: Sequence VerIfication for Procedures in Videos	Dec 13, 2021	Action DetectionAction Recognition	CodeCode Available	1
Continuous Human Action Detection Based on Wearable Inertial Data	Dec 11, 2021	Action DetectionGesture Recognition	—Unverified	0
X-Vector based voice activity detection for multi-genre broadcast speech-to-text	Dec 9, 2021	Action DetectionActivity Detection	CodeCode Available	1
User Activity Detection and Channel Estimation of Spatially Correlated Channels via AMP in Massive MTC	Dec 8, 2021	Action DetectionActivity Detection	—Unverified	0
DCAN: Improving Temporal Action Detection via Dual Context Aggregation	Dec 7, 2021	Action DetectionTemporal Action Localization	CodeCode Available	1
MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection	Dec 7, 2021	Action DetectionTemporal Action Localization	CodeCode Available	1
Learning Proximal Operator Methods for Massive Connectivity in IoT Networks	Dec 6, 2021	Action DetectionActivity Detection	—Unverified	0
Reformulating Zero-shot Action Recognition for Multi-label Actions	Dec 1, 2021	Action ClassificationAction Detection	—Unverified	0
Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information	Nov 28, 2021	Action DetectionActivity Detection	CodeCode Available	0
Weakly-guided Self-supervised Pretraining for Temporal Activity Detection	Nov 26, 2021	Action DetectionActivity Detection	CodeCode Available	0
User Activity Detection for Irregular Repetition Slotted Aloha based MMTC	Nov 11, 2021	Action DetectionActivity Detection	—Unverified	0

Show:10 25 50

← PrevPage 8 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	I3D + biGRU + VS-ST-MPNN	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified