Action Segmentation

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 219 papers

Title	Date	Tasks	Status
Leveraging Hierarchical Parametric Networks for Skeletal Joints Based Action Segmentation and Recognition	Jun 1, 2014	Action RecognitionAction Segmentation	—Unverified
Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences	Jan 29, 2020	Action RecognitionAction Segmentation	—Unverified
Action Segmentation with Mixed Temporal Domain Adaptation	Apr 15, 2021	Action SegmentationDomain Adaptation	—Unverified
Continuous Human Action Recognition for Human-Machine Interaction: A Review	Feb 26, 2022	Action RecognitionAction Segmentation	—Unverified
Leveraging Action Affinity and Continuity for Semi-supervised Temporal Action Segmentation	Jul 18, 2022	Action SegmentationTemporal Action Segmentation	—Unverified
Markov Game Video Augmentation for Action Segmentation	Jan 1, 2023	Action SegmentationData Augmentation	—Unverified
Condensing Action Segmentation Datasets via Generative Network Inversion	Mar 18, 2025	Action SegmentationIncremental Learning	—Unverified
Coherent Temporal Synthesis for Incremental Action Segmentation	Mar 10, 2024	Action RecognitionAction Segmentation	—Unverified
A Generalized & Robust Framework For Timestamp Supervision in Temporal Action Segmentation	Jul 20, 2022	Action SegmentationTAG	—Unverified
CASR: Refining Action Segmentation via Marginalizing Frame-levle Causal Relationships	Nov 21, 2023	Action SegmentationCausal Discovery	—Unverified
HOIST-Former: Hand-held Objects Identification Segmentation and Tracking in the Wild	Jan 1, 2024	Action SegmentationSegmentation	—Unverified
HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild	Apr 22, 2024	Action SegmentationSegmentation	—Unverified
ADL4D: Towards A Contextually Rich Dataset for 4D Activities of Daily Living	Feb 27, 2024	Action SegmentationObject	—Unverified
Learning a Weakly-Supervised Video Actor-Action Segmentation Model with a Wise Selection	Mar 29, 2020	Action SegmentationSegmentation	—Unverified
C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation	Dec 20, 2022	Action SegmentationDecoder	—Unverified
Human Action Segmentation With Hierarchical Supervoxel Consistency	Jun 1, 2015	Action ClassificationAction Segmentation	—Unverified
Human Action Sequence Classification	Oct 7, 2019	Action ClassificationAction Localization	—Unverified
Improving action segmentation via explicit similarity measurement	Feb 15, 2025	Action SegmentationBoundary Detection	—Unverified
Improving Action Segmentation via Graph-Based Temporal Reasoning	Jun 1, 2020	Action SegmentationRelation	—Unverified
A Circular Window-based Cascade Transformer for Online Action Detection	Aug 30, 2022	Action DetectionAction Segmentation	—Unverified
Hierarchical Attention Network for Action Segmentation	May 7, 2020	Action SegmentationSegmentation	—Unverified
Towards Weakly Supervised End-to-end Learning for Long-video Action Recognition	Nov 28, 2023	Action ClassificationAction Recognition	—Unverified
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation	Jun 3, 2019	Action RecognitionAction Segmentation	—Unverified
Hand Guided High Resolution Feature Enhancement for Fine-Grained Atomic Action Segmentation within Complex Human Assemblies	Nov 24, 2022	Action ClassificationAction Recognition	—Unverified
Grasp Type Revisited: A Modern Perspective on a Classical Feature for Vision	Jun 1, 2015	Action SegmentationAction Understanding	—Unverified

Show:10 25 50

← PrevPage 4 of 9Next →

All datasets Breakfast 50 Salads GTEA COIN Assembly101 JIGSAWS Youtube INRIA Instructional 50Salads MPII Cooking 2 Dataset

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AdaFocus (newly extracted I3D-features, LT-Context model)	Average F1	76.2	—	Unverified
2	FACT (efficient hybrid of convolution and transformer model)	Average F1	74.7	—	Unverified
3	ASQuery	Average F1	74.6	—	Unverified
4	BIT	Average F1	73.7	—	Unverified
5	DiffAct	Average F1	73.6	—	Unverified
6	BaFormer	Average F1	72.4	—	Unverified
7	CETNet	Average F1	71.8	—	Unverified
8	SF-TMN(ASFormer)	Average F1	71.6	—	Unverified
9	RF++-SSTDA	Acc	70.8	—	Unverified
10	ASPnet	Average F1	70.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Br-Prompt+ASPnet (RGB, flow, accelerometer)	F1@50%	88.5	—	Unverified
2	Semantic2Graph	F1@50%	87.3	—	Unverified
3	BaFormer	F1@50%	83.9	—	Unverified
4	DiffAct	F1@50%	83.7	—	Unverified
5	SF-TMN(ASFormer)	F1@50%	82.9	—	Unverified
6	LTContext	F1@50%	82	—	Unverified
7	UVAST	F1@50%	81.7	—	Unverified
8	Br-Prompt+ASFormer	F1@50%	81.3	—	Unverified
9	EUT	F1@50%	81	—	Unverified
10	CETNet	F1@50%	80.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Semantic2Graph	F1@50%	91.3	—	Unverified
2	FACT	F1@50%	87.5	—	Unverified
3	DiffAct	F1@50%	84.7	—	Unverified
4	BaFormer	F1@50%	83.5	—	Unverified
5	SF-TMN(ASFormer)	F1@50%	83.1	—	Unverified
6	Br-Prompt+ASFormer	F1@50%	83	—	Unverified
7	DPRN	F1@50%	82.9	—	Unverified
8	BIT	F1@50%	82.6	—	Unverified
9	CETNet	F1@50%	81.3	—	Unverified
10	UVAST	F1@50%	81	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UnLoc-L	Frame accuracy	72.8	—	Unverified
2	Univl	Frame accuracy	70	—	Unverified
3	Norton	Frame accuracy	69.8	—	Unverified
4	VideoClip	Frame accuracy	68.7	—	Unverified
5	TACo	Frame accuracy	68.4	—	Unverified
6	VLM	Frame accuracy	68.4	—	Unverified
7	MIL-NCE	Frame accuracy	61	—	Unverified
8	ActBERT	Frame accuracy	57	—	Unverified
9	CBT	Frame accuracy	53.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ASQuery	F1@10%	37.8	—	Unverified
2	LTContext	F1@10%	33.9	—	Unverified
3	ASFormer	F1@10%	33.4	—	Unverified
4	C2F-TCN	F1@10%	33.3	—	Unverified
5	UVAST	F1@10%	32.1	—	Unverified
6	MS-TCN++	F1@10%	31.6	—	Unverified
7	ProTAS(Offline)	F1@10%	28.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RL+Tree	Edit Distance	88.53	—	Unverified
2	RL (full)	Edit Distance	87.96	—	Unverified
3	TricorNet	Edit Distance	86.8	—	Unverified
4	SDL+SC-CRF	Edit Distance	86.21	—	Unverified
5	TCN	Edit Distance	83.1	—	Unverified
6	ST-CNN+Seg	Edit Distance	66.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TSA (FINCH)	Acc	62.4	—	Unverified
2	TSA (Kmeans)	Acc	59.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EUT	Acc	87.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unsup. TW-FINCH (K=avg/activity)	Accuracy	42	—	Unverified