Action Segmentation

Action Segmentation is a challenging problem in high-level video understanding. In its simplest form, Action Segmentation aims to segment a temporally untrimmed video by time and label each segmented part with one of pre-defined action labels. The results of Action Segmentation can be further used as input to various applications, such as video-to-text and action localization.

Source: TricorNet: A Hybrid Temporal Convolutional and Recurrent Network for Video Action Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 219 papers

Title	Date	Tasks	Status	Hype
Fast and Unsupervised Action Boundary Detection for Action Segmentation	Jan 1, 2022	Action SegmentationBoundary Detection	—Unverified	0
You Can Wash Hands Better: Accurate Daily Handwashing Assessment with a Smartwatch	Dec 9, 2021	Action SegmentationGesture Recognition	CodeCode Available	0
Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation	Dec 2, 2021	Action SegmentationRepresentation Learning	CodeCode Available	1
Towards Tokenized Human Dynamics Representation	Nov 22, 2021	Action SegmentationAction Understanding	CodeCode Available	1
Few-Shot Temporal Action Localization with Query Adaptive Transformer	Oct 20, 2021	Action LocalizationAction Segmentation	CodeCode Available	1
ASFormer: Transformer for Action Segmentation	Oct 16, 2021	Action SegmentationDecoder	CodeCode Available	1
Hierarchical Modeling for Task Recognition and Action Segmentation in Weakly-Labeled Instructional Videos	Oct 12, 2021	Action SegmentationSegmentation	CodeCode Available	0
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding	Sep 28, 2021	Action LocalizationAction Segmentation	—Unverified	0
Long Short View Feature Decomposition via Contrastive Video Representation Learning	Sep 23, 2021	Action RecognitionAction Segmentation	—Unverified	0
TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment	Aug 23, 2021	Action SegmentationContrastive Learning	—Unverified	0
Temporal Action Segmentation with High-level Complex Activity Labels	Aug 15, 2021	Action RecognitionAction Segmentation	—Unverified	0
FIFA: Fast Inference Approximation for Action Segmentation	Aug 9, 2021	Action SegmentationSegmentation	—Unverified	0
Unsupervised Action Segmentation for Instructional Videos	Jun 7, 2021	Action SegmentationSegmentation	—Unverified	0
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation	May 29, 2021	Action ParsingAction Segmentation	—Unverified	0
Unsupervised Action Segmentation by Joint Representation Learning and Online Clustering	May 27, 2021	Action SegmentationClustering	CodeCode Available	1
Coarse to Fine Multi-Resolution Temporal Convolutional Network	May 23, 2021	Action SegmentationDecoder	CodeCode Available	1
VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding	May 20, 2021	Action SegmentationLanguage Modeling	—Unverified	0
Efficient Two-Step Networks for Temporal Action Segmentation	Apr 30, 2021	Action SegmentationSegmentation	CodeCode Available	1
Unsupervised Discriminative Embedding for Sub-Action Learning in Complex Activities	Apr 30, 2021	Action RecognitionAction Segmentation	—Unverified	0
Action in Mind: A Neural Network Approach to Action Recognition and Segmentation	Apr 30, 2021	Action RecognitionAction Segmentation	—Unverified	0
Action Segmentation with Mixed Temporal Domain Adaptation	Apr 15, 2021	Action SegmentationDomain Adaptation	—Unverified	0
Anchor-Constrained Viterbi for Set-Supervised Action Segmentation	Apr 5, 2021	Action SegmentationSegmentation	—Unverified	0
Action Shuffle Alternating Learning for Unsupervised Action Segmentation	Apr 5, 2021	Action SegmentationSegmentation	—Unverified	0
Automated freezing of gait assessment with marker-based motion capture and multi-stage spatial-temporal graph convolutional neural networks	Mar 29, 2021	Action SegmentationSegmentation	CodeCode Available	1
Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation	Mar 20, 2021	Action SegmentationClustering	CodeCode Available	1

Show:10 25 50

← PrevPage 6 of 9Next →

All datasets Breakfast 50 Salads GTEA COIN Assembly101 JIGSAWS Youtube INRIA Instructional 50Salads MPII Cooking 2 Dataset

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AdaFocus (newly extracted I3D-features, LT-Context model)	Average F1	76.2	—	Unverified
2	FACT (efficient hybrid of convolution and transformer model)	Average F1	74.7	—	Unverified
3	ASQuery	Average F1	74.6	—	Unverified
4	BIT	Average F1	73.7	—	Unverified
5	DiffAct	Average F1	73.6	—	Unverified
6	BaFormer	Average F1	72.4	—	Unverified
7	CETNet	Average F1	71.8	—	Unverified
8	SF-TMN(ASFormer)	Average F1	71.6	—	Unverified
9	RF++-SSTDA	Acc	70.8	—	Unverified
10	ASPnet	Average F1	70.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Br-Prompt+ASPnet (RGB, flow, accelerometer)	F1@50%	88.5	—	Unverified
2	Semantic2Graph	F1@50%	87.3	—	Unverified
3	BaFormer	F1@50%	83.9	—	Unverified
4	DiffAct	F1@50%	83.7	—	Unverified
5	SF-TMN(ASFormer)	F1@50%	82.9	—	Unverified
6	LTContext	F1@50%	82	—	Unverified
7	UVAST	F1@50%	81.7	—	Unverified
8	Br-Prompt+ASFormer	F1@50%	81.3	—	Unverified
9	EUT	F1@50%	81	—	Unverified
10	CETNet	F1@50%	80.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Semantic2Graph	F1@50%	91.3	—	Unverified
2	FACT	F1@50%	87.5	—	Unverified
3	DiffAct	F1@50%	84.7	—	Unverified
4	BaFormer	F1@50%	83.5	—	Unverified
5	SF-TMN(ASFormer)	F1@50%	83.1	—	Unverified
6	Br-Prompt+ASFormer	F1@50%	83	—	Unverified
7	DPRN	F1@50%	82.9	—	Unverified
8	BIT	F1@50%	82.6	—	Unverified
9	CETNet	F1@50%	81.3	—	Unverified
10	UVAST	F1@50%	81	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UnLoc-L	Frame accuracy	72.8	—	Unverified
2	Univl	Frame accuracy	70	—	Unverified
3	Norton	Frame accuracy	69.8	—	Unverified
4	VideoClip	Frame accuracy	68.7	—	Unverified
5	TACo	Frame accuracy	68.4	—	Unverified
6	VLM	Frame accuracy	68.4	—	Unverified
7	MIL-NCE	Frame accuracy	61	—	Unverified
8	ActBERT	Frame accuracy	57	—	Unverified
9	CBT	Frame accuracy	53.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ASQuery	F1@10%	37.8	—	Unverified
2	LTContext	F1@10%	33.9	—	Unverified
3	ASFormer	F1@10%	33.4	—	Unverified
4	C2F-TCN	F1@10%	33.3	—	Unverified
5	UVAST	F1@10%	32.1	—	Unverified
6	MS-TCN++	F1@10%	31.6	—	Unverified
7	ProTAS(Offline)	F1@10%	28.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RL+Tree	Edit Distance	88.53	—	Unverified
2	RL (full)	Edit Distance	87.96	—	Unverified
3	TricorNet	Edit Distance	86.8	—	Unverified
4	SDL+SC-CRF	Edit Distance	86.21	—	Unverified
5	TCN	Edit Distance	83.1	—	Unverified
6	ST-CNN+Seg	Edit Distance	66.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TSA (FINCH)	Acc	62.4	—	Unverified
2	TSA (Kmeans)	Acc	59.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	EUT	Acc	87.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Unsup. TW-FINCH (K=avg/activity)	Accuracy	42	—	Unverified