SOTAVerified

Latent Space Semi-Supervised Time Series Data Clustering

2021-01-01Unverified0· sign in to hype

Andrew Hill, Katerina Kechris, Russell Bowler, Farnoush Kashani

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Time series data is abundantly available in the real world, but there is a distinct lack of large, labeled datasets available for many types of learning tasks. Semi-supervised models, which can leverage small amounts of expert-labeled data along with a larger unlabeled dataset, have been shown to improve performance over unsupervised learning models. Existing semi-supervised time series clustering algorithms suffer from lack of scalability as they are limited to perform learning operations within the original data space. We propose an autoencoder-based semi-supervised learning model along with multiple semi-supervised objective functions which can be used to improve the quality of the autoencoder’s learned latent space via the addition of a small number of labeled examples. Experiments show that our methods can consistently improve k-Means clustering performance on a variety of datasets. Our methods achieve a maximum average ARI of 0.897, a 140% increase over an unsupervised CAE model. Our methods also achieve a maximum improvement of 44% over a semi-supervised model.

Tasks

Reproductions