Meta-learning-based percussion transcription and tala identification from low-resource audio
Rahul Bapusaheb Kodag, Vipul Arora
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This study introduces a meta-learning-based approach for low-resource Tabla Stroke Transcription (TST) and tala identification in Hindustani classical music. Using Model-Agnostic Meta-Learning (MAML), we address the challenges of limited annotated datasets and label heterogeneity, enabling rapid adaptation to new tasks with minimal data. The method is validated across various datasets, including tabla solo and concert recordings, demonstrating robustness in polyphonic audio scenarios. We propose two novel tala identification techniques based on stroke sequences and rhythmic patterns. Additionally, the approach proves effective for Automatic Drum Transcription (ADT), showcasing its flexibility for Indian and Western percussion music. Experimental results show that the proposed method outperforms existing techniques in low-resource settings, significantly contributing to music transcription and studying musical traditions through computational tools.