SOTAVerified

Categorical Unsupervised Variational Acoustic Clustering

2025-04-10Unverified0· sign in to hype

Luan Vinícius Fiorio, Ivana Nikoloska, Ronald M. Aarts

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We propose a categorical approach for unsupervised variational acoustic clustering of audio data in the time-frequency domain. The consideration of a categorical distribution enforces sharper clustering even when data points strongly overlap in time and frequency, which is the case for most datasets of urban acoustic scenes. To this end, we use a Gumbel-Softmax distribution as a soft approximation to the categorical distribution, allowing for training via backpropagation. In this settings, the softmax temperature serves as the main mechanism to tune clustering performance. The results show that the proposed model can obtain impressive clustering performance for all considered datasets, even when data points strongly overlap in time and frequency.

Tasks

Reproductions