SOTAVerified

Learn The Big Picture: Representation Learning for Clustering

2021-08-01ACL (RepL4NLP) 2021Code Available0· sign in to hype

Sumanta Kashyapi, Laura Dietz

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Existing supervised models for text clustering find it difficult to directly optimize for clustering results. This is because clustering is a discrete process and it is difficult to estimate meaningful gradient of any discrete function that can drive gradient based optimization algorithms. So, existing supervised clustering algorithms indirectly optimize for some continuous function that approximates the clustering process. We propose a scalable training strategy that directly optimizes for a discrete clustering metric. We train a BERT-based embedding model using our method and evaluate it on two publicly available datasets. We show that our method outperforms another BERT-based embedding model employing Triplet loss and other unsupervised baselines. This suggests that optimizing directly for the clustering outcome indeed yields better representations suitable for clustering.

Tasks

Reproductions