Action Detection

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 817 papers

Title	Date	Tasks	Status
Identifying Visible Actions in Lifestyle Vlogs	Jun 10, 2019	Action Detection	CodeCode Available
rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method	Jun 9, 2019	Action DetectionActivity Detection	CodeCode Available
Two-Stream Region Convolutional 3D Network for Temporal Activity Detection	Jun 5, 2019	Action DetectionAction Recognition	—Unverified
Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model	Jun 1, 2019	Action Detectionreinforcement-learning	—Unverified
TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection	May 31, 2019	Action Detection	—Unverified
Improving Action Localization by Progressive Cross-stream Cooperation	May 28, 2019	Action ClassificationAction Detection	—Unverified
Representation Learning on Visual-Symbolic Graphs for Video Understanding	May 17, 2019	Action ClassificationAction Detection	—Unverified
Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection	May 11, 2019	Action DetectionActivity Detection	—Unverified
Spatio-Temporal Action Localization in a Weakly Supervised Setting	May 6, 2019	Action DetectionAction Localization	—Unverified
A Study on Action Detection in the Wild	Apr 29, 2019	Action Detection	—Unverified
Simple yet efficient real-time pose-based action recognition	Apr 19, 2019	Action DetectionAction Recognition	CodeCode Available
STEP: Spatio-Temporal Progressive Learning for Video Action Detection	Apr 19, 2019	Action DetectionAction Recognition	CodeCode Available
Weakly Supervised Gaussian Networks for Action Detection	Apr 16, 2019	Action DetectionAction Localization	—Unverified
Decoupling Localization and Classification in Single Shot Temporal Action Detection	Apr 16, 2019	Action DetectionClassification	CodeCode Available
Dance with Flow: Two-in-One Stream Action Detection	Apr 1, 2019	Action DetectionOptical Flow Estimation	CodeCode Available
Emotion Action Detection and Emotion Inference: the Task and Dataset	Mar 16, 2019	Action DetectionEmotion Classification	CodeCode Available
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis	Mar 7, 2019	Action Detection	—Unverified
Towards Segmenting Anything That Moves	Feb 11, 2019	Action DetectionInstance Segmentation	CodeCode Available
An Ensemble SVM-based Approach for Voice Activity Detection	Feb 5, 2019	Action DetectionActivity Detection	—Unverified
Spatio-temporal Action Recognition: A Survey	Jan 27, 2019	Action DetectionAction Localization	—Unverified
Actor Conditioned Attention Maps for Video Action Detection	Dec 30, 2018	Action DetectionVideo Action Detection	CodeCode Available
Similarity R-C3D for Few-shot Temporal Activity Detection	Dec 25, 2018	Action DetectionActivity Detection	—Unverified
A Structured Model For Action Detection	Dec 9, 2018	Action Detectionmodel	—Unverified
Tri-axial Self-Attention for Concurrent Activity Recognition	Dec 6, 2018	Action DetectionActivity Detection	—Unverified
Computational Graph Approach for Detection of Composite Human Activities	Dec 5, 2018	Action DetectionActivity Detection	—Unverified
Structure-Aware Convolutional Neural Networks	Dec 1, 2018	Action DetectionAction Recognition	CodeCode Available
Discovering Spatio-Temporal Action Tubes	Nov 29, 2018	Action Detection	—Unverified
Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection	Nov 21, 2018	Action DetectionFine-Grained Action Detection	CodeCode Available
A Proposal-Based Solution to Spatio-Temporal Action Detection in Untrimmed Videos	Nov 20, 2018	Action ClassificationAction Detection	—Unverified
Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection	Nov 19, 2018	Action DetectionMultiple Action Detection	—Unverified
Temporal Recurrent Networks for Online Action Detection	Nov 18, 2018	Action DetectionOnline Action Detection	CodeCode Available
Recurrent Convolutions for Causal 3D CNNs	Nov 17, 2018	Action Detection	—Unverified
BLP -- Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization	Nov 6, 2018	Action DetectionAction Localization	—Unverified
Temporal Action Detection by Joint Identification-Verification	Oct 19, 2018	Action Detection	—Unverified
Sequence Block based Compressed Sensing Multiuser Detection for 5G	Sep 28, 2018	Action DetectionActivity Detection	—Unverified
End-to-end Audiovisual Speech Activity Detection with Bimodal Recurrent Neural Models	Sep 12, 2018	Action DetectionActivity Detection	—Unverified
Recurrent Tubelet Proposal and Recognition Networks for Action Detection	Sep 1, 2018	Action DetectionRegion Proposal	—Unverified
AAD: Adaptive Anomaly Detection through traffic surveillance videos	Aug 29, 2018	Action DetectionActivity Detection	—Unverified
Predicting Action Tubes	Aug 23, 2018	Action ClassificationAction Detection	—Unverified
Dynamic Temporal Pyramid Network: A Closer Look at Multi-Scale Modeling for Activity Detection	Aug 7, 2018	Action DetectionActivity Detection	—Unverified
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition	Jul 31, 2018	Action DetectionActivity Detection	—Unverified
Action Detection from a Robot-Car Perspective	Jul 30, 2018	Action DetectionActivity Detection	—Unverified
Actor-Centric Relation Network	Jul 28, 2018	Action ClassificationAction Detection	—Unverified
S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Networks	Jul 21, 2018	Action DetectionActivity Detection	CodeCode Available
Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection	Jul 16, 2018	Action DetectionObject	—Unverified
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector	Jul 9, 2018	Action DetectionTemporal Localization	—Unverified
A deep learning approach for understanding natural language commands for mobile service robots	Jul 9, 2018	Action DetectionIntent Detection	—Unverified
Neural Dialogue Context Online End-of-Turn Detection	Jul 1, 2018	Action DetectionSpoken Dialogue Systems	—Unverified
A flexible model for training action localization with varying levels of supervision	Jun 29, 2018	Action DetectionAction Localization	CodeCode Available
Modality Distillation with Multiple Stream Networks for Action Recognition	Jun 19, 2018	Action ClassificationAction Detection	CodeCode Available

Show:10 25 50

← PrevPage 14 of 17Next →

All datasets UCF101-24 J-HMDB Charades Multi-THUMOS UCF Sports THUMOS' 14 MultiSports TSU TTStroke-21 ME21 TTStroke-21 ME22 MultiTHUMOS

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	STAR/L	Frame-mAP 0.5	90.3	—	Unverified
2	SiA	Frame-mAP 0.5	88.5	—	Unverified
3	YOWO + LFB	Frame-mAP 0.5	87.3	—	Unverified
4	HIT	Frame-mAP 0.5	84.8	—	Unverified
5	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	82.3	—	Unverified
6	YOWO	Frame-mAP 0.5	80.4	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.2	78.48	—	Unverified
8	MOC	Frame-mAP 0.5	77.8	—	Unverified
9	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	76.3	—	Unverified
10	Two-in-one	Video-mAP 0.2	75.48	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SiA	Frame-mAP 0.5	88.5	—	Unverified
2	HISAN (ResNet-101 + FPN)	Video-mAP 0.2	87.59	—	Unverified
3	HIT	Frame-mAP 0.5	83.8	—	Unverified
4	HISAN (VGG-16)	Frame-mAP 0.5	76.72	—	Unverified
5	DTS	Video-mAP 0.2	76.1	—	Unverified
6	YOWO + LFB	Frame-mAP 0.5	75.7	—	Unverified
7	Two-in-one Two Stream	Video-mAP 0.5	74.74	—	Unverified
8	YOWO	Frame-mAP 0.5	74.4	—	Unverified
9	MOC	Frame-mAP 0.5	74	—	Unverified
10	Faster-RCNN + two-stream I3D conv	Frame-mAP 0.5	73.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TTM	mAP	28.79	—	Unverified
2	CTRN	mAP	27.8	—	Unverified
3	Coarse-Fine Networks (w/ self-supervised detection pretraining)	mAP	26.95	—	Unverified
4	UniMD+Sync. (RGB+Flow)	mAP	26.53	—	Unverified
5	PDAN (RGB+Flow)	mAP	26.5	—	Unverified
6	PAT	mAP	26.5	—	Unverified
7	MS-TCT (RGB only)	mAP	25.4	—	Unverified
8	3D ResNet-50 + super-events pretrained on AViD	mAP	25.2	—	Unverified
9	Coarse-Fine Networks	mAP	25.1	—	Unverified
10	MLAD (RGB + Flow)	mAP	23.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MLAD	mAP	51.5	—	Unverified
2	CTRN	mAP	51.2	—	Unverified
3	PDAN	mAP	47.6	—	Unverified
4	TGM	mAP	46.4	—	Unverified
5	MS-TCT (RGB only)	mAP	43.1	—	Unverified
6	I3D + our super-event	mAP	36.4	—	Unverified
7	Two-stream + LSTM	mAP	28.1	—	Unverified
8	Two-stream	mAP	27.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Two-in-one Two Stream	Video-mAP 0.5	96.52	—	Unverified
2	DTS	Video-mAP 0.2	94.3	—	Unverified
3	Two-in-one	Video-mAP 0.5	92.74	—	Unverified
4	T-CNN	Frame-mAP 0.5	86.7	—	Unverified
5	MR-TS R-CNN	Frame-mAP 0.5	84.52	—	Unverified
6	TS R-CNN	Frame-mAP 0.5	82.3	—	Unverified
7	Action Tubes	Frame-mAP 0.5	68.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MAT (Ours) Trans	mAP	71.6	—	Unverified
2	TadML-two stream	mAP	59.7	—	Unverified
3	MAT (ours)	mAP	58.2	—	Unverified
4	TadML-rgb	mAP	53.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HIT	Frame-mAP 0.5	33.3	—	Unverified
2	SiA	Frame-mAP 0.5	28.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MS-TCT	Frame-mAP	33.7	—	Unverified
2	PDAN	Frame-mAP	32.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN	IoU	0.14	—	Unverified
2	Two Stream Network	IoU	0.07	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STCNN-V2 (Vote decision)	IoU	0.52	—	Unverified
2	RGB and PRGB	IoU	0.35	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PAT	mAP	44.6	—	Unverified