| FLUX that Plays Music | Sep 1, 2024 | Music GenerationText-to-Music Generation | CodeCode Available | 13 | 5 |
| YuE: Scaling Open Foundation Models for Long-Form Music Generation | Mar 11, 2025 | FormIn-Context Learning | CodeCode Available | 9 | 5 |
| Long-form music generation with latent diffusion | Apr 16, 2024 | Audio GenerationForm | CodeCode Available | 7 | 5 |
| DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion | Mar 3, 2025 | Music Generation | CodeCode Available | 7 | 5 |
| Simple and Controllable Music Generation | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| MusicLM: Generating Music From Text | Jan 26, 2023 | Music GenerationText-to-Music Generation | CodeCode Available | 6 | 5 |
| LeVo: High-Quality Song Generation with Multi-Preference Alignment | Jun 9, 2025 | Instruction FollowingMusic Generation | CodeCode Available | 5 | 5 |
| NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 | 5 |
| InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Feb 28, 2025 | Audio GenerationForm | CodeCode Available | 5 | 5 |
| Quality-aware Masked Diffusion Transformer for Enhanced Music Generation | May 24, 2024 | DiversityMusic Generation | CodeCode Available | 4 | 5 |
| SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement | Jun 9, 2025 | Music Generation | CodeCode Available | 4 | 5 |
| Moûsai: Text-to-Music Generation with Long-Context Latent Diffusion | Jan 27, 2023 | GPUImage Generation | CodeCode Available | 4 | 5 |
| Vision-to-Music Generation: A Survey | Mar 27, 2025 | multimodal generationMusic Generation | CodeCode Available | 3 | 5 |
| COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations | Apr 25, 2024 | Contrastive LearningMusic Generation | CodeCode Available | 3 | 5 |
| Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance | Sep 23, 2024 | Emotion RecognitionFAD | CodeCode Available | 3 | 5 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 | 5 |
| Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional Representation | Jul 30, 2024 | DisentanglementMusic Generation | CodeCode Available | 2 | 5 |
| Symphony Generation with Permutation Invariant Language Model | May 10, 2022 | Audio GenerationDecoder | CodeCode Available | 2 | 5 |
| Diff-BGM: A Diffusion Model for Video Background Music Generation | May 20, 2024 | DiversityMusic Generation | CodeCode Available | 2 | 5 |
| ETTA: Elucidating the Design Space of Text-to-Audio Models | Dec 26, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 | 5 |
| VampNet: Music Generation via Masked Acoustic Token Modeling | Jul 10, 2023 | Music CompressionMusic Generation | CodeCode Available | 2 | 5 |
| Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion | Feb 22, 2024 | Music Generation | CodeCode Available | 2 | 5 |
| VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling | Jun 6, 2024 | DiversityMusic Generation | CodeCode Available | 2 | 5 |
| Predicting Human Brain States with Transformer | Dec 11, 2024 | Language ModellingMusic Generation | CodeCode Available | 2 | 5 |
| Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation | Apr 25, 2018 | Music Generation | CodeCode Available | 2 | 5 |
| TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument | Feb 13, 2025 | Audio GenerationDecoder | CodeCode Available | 2 | 5 |
| PAM: Prompting Audio-Language Models for Audio Quality Assessment | Feb 1, 2024 | Audio Quality AssessmentMusic Generation | CodeCode Available | 2 | 5 |
| MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation | Jul 21, 2024 | DiversityMusic Generation | CodeCode Available | 2 | 5 |
| MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment | Sep 19, 2017 | Music Generation | CodeCode Available | 2 | 5 |
| Musika! Fast Infinite Waveform Music Generation | Aug 18, 2022 | CPUGenerative Adversarial Network | CodeCode Available | 2 | 5 |
| Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges | Apr 24, 2024 | Drug DesignInductive Bias | CodeCode Available | 2 | 5 |
| Melody-Guided Music Generation | Sep 30, 2024 | cross-modal alignmentMusic Generation | CodeCode Available | 2 | 5 |
| Moonbeam: A MIDI Foundation Model Using Both Absolute and Relative Music Attributes | May 21, 2025 | Music ClassificationMusic Generation | CodeCode Available | 2 | 5 |
| Mustango: Toward Controllable Text-to-Music Generation | Nov 14, 2023 | Data AugmentationDenoising | CodeCode Available | 2 | 5 |
| Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models | May 16, 2024 | Music Generation | CodeCode Available | 2 | 5 |
| Multi-instrument Music Synthesis with Spectrogram Diffusion | Jun 11, 2022 | DecoderGenerative Adversarial Network | CodeCode Available | 2 | 5 |
| Anticipatory Music Transformer | Jun 14, 2023 | Music Generation | CodeCode Available | 1 | 5 |
| Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset | Oct 29, 2018 | Music GenerationMusic Modeling | CodeCode Available | 1 | 5 |
| Is Disentanglement enough? On Latent Representations for Controllable Music Generation | Aug 1, 2021 | DecoderDisentanglement | CodeCode Available | 1 | 5 |
| Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System | Apr 5, 2020 | Music Generation | CodeCode Available | 1 | 5 |
| Investigating Personalization Methods in Text to Music Generation | Sep 20, 2023 | Data AugmentationMusic Generation | CodeCode Available | 1 | 5 |
| It's Raw! Audio Generation with State-Space Models | Feb 20, 2022 | Audio GenerationDensity Estimation | CodeCode Available | 1 | 5 |
| Do Music Generation Models Encode Music Theory? | Oct 1, 2024 | Emotion RecognitionGenre classification | CodeCode Available | 1 | 5 |
| Discrete Diffusion Probabilistic Models for Symbolic Music Generation | May 16, 2023 | DenoisingMusic Generation | CodeCode Available | 1 | 5 |
| ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption Refinement | Feb 6, 2025 | Music GenerationRhythm | CodeCode Available | 1 | 5 |
| Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation | Jun 15, 2022 | Contrastive LearningDenoising | CodeCode Available | 1 | 5 |
| DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models | Jul 30, 2021 | DecoderGenre classification | CodeCode Available | 1 | 5 |
| Hierarchical Timbre-Painting and Articulation Generation | Aug 30, 2020 | Music Generation | CodeCode Available | 1 | 5 |
| Incorporating Music Knowledge in Continual Dataset Augmentation for Music Generation | Jun 23, 2020 | Music Generation | CodeCode Available | 1 | 5 |
| JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation | Oct 29, 2023 | Music Generation | CodeCode Available | 1 | 5 |