FMA: A Dataset For Music Analysis

2016-12-06ISMIR 2017Code Available1· sign in to hype

Michaël Defferrard, Kirell Benzi, Pierre Vandergheynst, Xavier Bresson

Code Available — Be the first to reproduce this paper.

Code

github.com/mdeff/fma
OfficialIn papertf★ 0
github.com/microsoft/fadtk
pytorch★ 251
github.com/darius522/dnr-utils
none★ 73
github.com/MorenoLaQuatra/ARCH
pytorch★ 54
github.com/cocktail-fork/cocktail-fork.github.io
none★ 5
github.com/dcase2024-task7-sound-scene-synthesis/fadtk
pytorch★ 4
github.com/Manmayi/Music-Data-Visualization
none★ 1
github.com/karn1986/fma_pytorch
pytorch★ 0
github.com/KrishnaManmayi/Music-Data-Visualization
none★ 0
github.com/markcutajar/raw-music-tagging-cnns
tf★ 0

Abstract

We introduce the Free Music Archive (FMA), an open and easily accessible dataset suitable for evaluating several tasks in MIR, a field concerned with browsing, searching, and organizing large music collections. The community's growing interest in feature and end-to-end learning is however restrained by the limited availability of large audio datasets. The FMA aims to overcome this hurdle by providing 917 GiB and 343 days of Creative Commons-licensed audio from 106,574 tracks from 16,341 artists and 14,854 albums, arranged in a hierarchical taxonomy of 161 genres. It provides full-length and high-quality audio, pre-computed features, together with track- and user-level metadata, tags, and free-form text such as biographies. We here describe the dataset and how it was created, propose a train/validation/test split and three subsets, discuss some suitable MIR tasks, and evaluate some baselines for genre recognition. Code, data, and usage examples are available at https://github.com/mdeff/fma

FMA: A Dataset For Music Analysis

Code

Abstract

Reproductions