Leveraging triplet loss for unsupervised action segmentation

2023-04-13Code Available1· sign in to hype

E. Bueno-Benito, B. Tura, M. Dimiccoli

Code Available — Be the first to reproduce this paper.

Code

github.com/elenabbbuenob/tsa-actionseg
OfficialIn paperpytorch★ 10

Abstract

In this paper, we propose a novel fully unsupervised framework that learns action representations suitable for the action segmentation task from the single input video itself, without requiring any training data. Our method is a deep metric learning approach rooted in a shallow network with a triplet loss operating on similarity distributions and a novel triplet selection strategy that effectively models temporal and semantic priors to discover actions in the new representational space. Under these circumstances, we successfully recover temporal boundaries in the learned action representations with higher quality compared with existing unsupervised approaches. The proposed method is evaluated on two widely used benchmark datasets for the action segmentation task and it achieves competitive performance by applying a generic clustering algorithm on the learned representations.

Tasks

Action Segmentation Clustering Metric Learning Segmentation Triplet Unsupervised Action Segmentation Video Understanding

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Breakfast	TSA (FINCH)	Acc	65.1	—	Unverified
Breakfast	TSA (Kmeans)	Acc	63.7	—	Unverified
Breakfast	TSA (Spectral)	Acc	63.2	—	Unverified
Youtube INRIA Instructional	TSA (FINCH)	Acc	62.4	—	Unverified
Youtube INRIA Instructional	TSA (Kmeans)	Acc	59.7	—	Unverified

Leveraging triplet loss for unsupervised action segmentation

Code

Abstract

Tasks

Benchmark Results

Reproductions