Learning Disentangled Audio Representations through Controlled Synthesis
2024-02-16Unverified0· sign in to hype
Yusuf Brima, Ulf Krumnack, Simone Pika, Gunther Heidemann
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This paper tackles the scarcity of benchmarking data in disentangled auditory representation learning. We introduce SynTone, a synthetic dataset with explicit ground truth explanatory factors for evaluating disentanglement techniques. Benchmarking state-of-the-art methods on SynTone highlights its utility for method evaluation. Our results underscore strengths and limitations in audio disentanglement, motivating future research.