SOTAVerified

Music Transcription

Music transcription is the task of converting an acoustic musical signal into some form of music notation.

( Image credit: ISMIR 2015 Tutorial - Automatic Music Transcription )

Papers

Showing 125 of 96 papers

TitleStatusHype
A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch EstimationCode4
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem AugmentationCode3
Scoring Time Intervals using Non-Hierarchical Transformer For Automatic Piano TranscriptionCode3
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument LeakageCode2
Cross-domain Neural Pitch and Periodicity EstimationCode2
M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing CorpusCode2
Omnizart: A General Toolbox for Automatic Music TranscriptionCode2
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano TranscriptionCode1
Source Separation & Automatic Transcription for MusicCode1
End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical DecodingCode1
Automatic Piano Transcription with Hierarchical Frequency-Time TransformerCode1
FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature TranscriptionCode1
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative ModelingCode1
Unaligned Supervision For Automatic Music Transcription in The WildCode1
Skipping the Frame-Level: Event-Based Piano Transcription With Neural Semi-CRFsCode1
MT3: Multi-Task Multitrack Music TranscriptionCode1
A Unified Model for Zero-shot Music Source Separation, Transcription and SynthesisCode1
Sequence-to-Sequence Piano Transcription with TransformersCode1
The Effect of Spectrogram Reconstruction on Automatic Music Transcription: An Alternative Approach to Improve Transcription AccuracyCode1
The MIDI Degradation Toolkit: Symbolic Music Augmentation and CorrectionCode1
Residual Shuffle-Exchange Networks for Fast Processing of Long SequencesCode1
Bayesian Sparsification of Deep C-valued NetworksCode1
A holistic approach to polyphonic music transcription with neural networksCode1
Enabling Factorized Piano Music Modeling and Generation with the MAESTRO DatasetCode1
Onsets and Frames: Dual-Objective Piano TranscriptionCode1
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transkun V2 (SemiCRF)Onset F198.32Unverified
2hFT-TransformerOnset F197.44Unverified
3Kim et al.Onset F197.23Unverified
4YourMT3+ (YPTF.MoE+M) noPSOnset F196.98Unverified
5Kong et al.Onset F196.72Unverified
6YourMT3+ (YPTF.MoE+M)Onset F196.52Unverified
7Semi-CRFOnset F196.11Unverified
8MT3 (single dataset)Onset F188Unverified
9MT3 (multi dataset)Onset F186Unverified
#ModelMetricClaimedVerifiedStatus
1Transkun V2 (SemiCRF) with Data AugmentationOnset F190.38Unverified
2YourMT3+ (YPTF.MoE+M, unseen) noPSOnset F188.73Unverified
3Edwards et al.Onset F188.4Unverified
4YourMT3+ (YPTF+S, unseen)Onset F188.37Unverified
5Transkun V2 (SemiCRF)Onset F186.1Unverified
6hFT-TransformerOnset F185.14Unverified
#ModelMetricClaimedVerifiedStatus
1Residual Shuffle-Exchange networkAPS78.02Unverified
2Complex TransformerAPS74.22Unverified
3Deep Complex NetworkAPS72.9Unverified
4Concatenated TransformerAPS71.3Unverified
5Deep Real NetworkAPS69.6Unverified
6CNN (64 stride)APS67.8Unverified
#ModelMetricClaimedVerifiedStatus
1YourMT3+ (YPTF.MoE+M)note-level F-measure-no-offset (Fno)0.85Unverified
2PerceiverTFnote-level F-measure-no-offset (Fno)0.82Unverified
3MT3 (colab)note-level F-measure-no-offset (Fno)0.75Unverified
4Jointistnote-level F-measure-no-offset (Fno)0.6Unverified
5MT3note-level F-measure-no-offset (Fno)0.57Unverified
6Basic Pitchnote-level F-measure-no-offset (Fno)0.43Unverified
#ModelMetricClaimedVerifiedStatus
1YourMT3+ (YPTF.MoE+M)Onset F181.79Unverified
2MT3Onset F177Unverified
#ModelMetricClaimedVerifiedStatus
1Transkun V2 (SemiCRF) with Data AugmentationOnset F198.71Unverified