SOTAVerified

Music Transcription

Music transcription is the task of converting an acoustic musical signal into some form of music notation.

( Image credit: ISMIR 2015 Tutorial - Automatic Music Transcription )

Papers

Showing 150 of 96 papers

TitleStatusHype
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription0
Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System0
Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio0
Automatic Music Transcription using Convolutional Neural Networks and Constant-Q transform0
Music Tempo Estimation on Solo Instrumental Performance0
Scalable Approximate Algorithms for Optimal Transport Linear ModelsCode0
Multi-task learning-based temporal pattern matching network for guitar tablature transcription0
D3RM: A Discrete Denoising Diffusion Refinement Model for Piano TranscriptionCode1
Meta-learning-based percussion transcription and tala identification from low-resource audio0
Piano Transcription by Hierarchical Language Modeling with Pretrained Roll-based Encoders0
Tuning Music Education: AI-Powered Personalization in Learning Music0
Source Separation & Automatic Transcription for MusicCode1
A Transformer-Based Visual Piano Transcription Algorithm0
Just Label the Repeats for In-The-Wild Audio-to-Score AlignmentCode0
Music Foundation Model as Generic Booster for Music Downstream Tasks0
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model0
Development of Large Annotated Music Datasets using HMM-based Forced Viterbi Alignment0
Quantifying the Corpus Bias Problem in Automatic Music Transcription SystemsCode0
Polyphonic Piano Music Transcription System Exploiting Mutual Correlations of Different Musical Note States0
YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem AugmentationCode3
Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey0
End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical DecodingCode1
Scoring Time Intervals using Non-Hierarchical Transformer For Automatic Piano TranscriptionCode3
MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument LeakageCode2
High Resolution Guitar Transcription via Domain Adaptation0
Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys0
A Data-Driven Analysis of Robust Automatic Piano Transcription0
Annotation-free Automatic Music Transcription with Scalable Synthetic Data and Adversarial Domain Confusion0
Improving Drumming Robot Via Attention Transformer Network0
Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription0
AIoT-Based Drum Transcription Robot using Convolutional Neural Networks0
Multi-modal Multi-view Clustering based on Non-negative Matrix Factorization0
Automatic Piano Transcription with Hierarchical Frequency-Time TransformerCode1
Multitrack Music Transcription with a Time-Frequency Perceiver0
Transfer of knowledge among instruments in automatic music transcription0
A Phoneme-Informed Neural Network Model for Note-Level Singing Transcription0
From Audio to Symbolic Encoding0
Cross-domain Neural Pitch and Periodicity EstimationCode2
M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing CorpusCode2
FretNet: Continuous-Valued Pitch Contour Streaming for Polyphonic Guitar Tablature TranscriptionCode1
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability0
The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative ModelingCode1
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications0
Unaligned Supervision For Automatic Music Transcription in The WildCode1
Exploring Transformer's potential on automatic piano transcription0
Late multimodal fusion for image and audio music transcription0
Acoustics-specific Piano Velocity EstimationCode0
A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch EstimationCode4
A Perceptual Measure for Evaluating the Resynthesis of Automatic Music TranscriptionsCode0
Deep-Learning Architectures for Multi-Pitch Estimation: Towards Reliable Evaluation0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transkun V2 (SemiCRF)Onset F198.32Unverified
2hFT-TransformerOnset F197.44Unverified
3Kim et al.Onset F197.23Unverified
4YourMT3+ (YPTF.MoE+M) noPSOnset F196.98Unverified
5Kong et al.Onset F196.72Unverified
6YourMT3+ (YPTF.MoE+M)Onset F196.52Unverified
7Semi-CRFOnset F196.11Unverified
8MT3 (single dataset)Onset F188Unverified
9MT3 (multi dataset)Onset F186Unverified
#ModelMetricClaimedVerifiedStatus
1Transkun V2 (SemiCRF) with Data AugmentationOnset F190.38Unverified
2YourMT3+ (YPTF.MoE+M, unseen) noPSOnset F188.73Unverified
3Edwards et al.Onset F188.4Unverified
4YourMT3+ (YPTF+S, unseen)Onset F188.37Unverified
5Transkun V2 (SemiCRF)Onset F186.1Unverified
6hFT-TransformerOnset F185.14Unverified
#ModelMetricClaimedVerifiedStatus
1Residual Shuffle-Exchange networkAPS78.02Unverified
2Complex TransformerAPS74.22Unverified
3Deep Complex NetworkAPS72.9Unverified
4Concatenated TransformerAPS71.3Unverified
5Deep Real NetworkAPS69.6Unverified
6CNN (64 stride)APS67.8Unverified
#ModelMetricClaimedVerifiedStatus
1YourMT3+ (YPTF.MoE+M)note-level F-measure-no-offset (Fno)0.85Unverified
2PerceiverTFnote-level F-measure-no-offset (Fno)0.82Unverified
3MT3 (colab)note-level F-measure-no-offset (Fno)0.75Unverified
4Jointistnote-level F-measure-no-offset (Fno)0.6Unverified
5MT3note-level F-measure-no-offset (Fno)0.57Unverified
6Basic Pitchnote-level F-measure-no-offset (Fno)0.43Unverified
#ModelMetricClaimedVerifiedStatus
1YourMT3+ (YPTF.MoE+M)Onset F181.79Unverified
2MT3Onset F177Unverified
#ModelMetricClaimedVerifiedStatus
1Transkun V2 (SemiCRF) with Data AugmentationOnset F198.71Unverified