| Quality-aware Masked Diffusion Transformer for Enhanced Music Generation | May 24, 2024 | DiversityMusic Generation | CodeCode Available | 4 |
| SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement | Jun 9, 2025 | Music Generation | CodeCode Available | 4 |
| Vision-to-Music Generation: A Survey | Mar 27, 2025 | multimodal generationMusic Generation | CodeCode Available | 3 |
| BigVGAN: A Universal Neural Vocoder with Large-Scale Training | Jun 9, 2022 | Audio GenerationAudio Synthesis | CodeCode Available | 3 |
| Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance | Sep 23, 2024 | Emotion RecognitionFAD | CodeCode Available | 3 |
| COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations | Apr 25, 2024 | Contrastive LearningMusic Generation | CodeCode Available | 3 |
| MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment | Sep 19, 2017 | Music Generation | CodeCode Available | 2 |
| Multi-instrument Music Synthesis with Spectrogram Diffusion | Jun 11, 2022 | DecoderGenerative Adversarial Network | CodeCode Available | 2 |
| Moonbeam: A MIDI Foundation Model Using Both Absolute and Relative Music Attributes | May 21, 2025 | Music ClassificationMusic Generation | CodeCode Available | 2 |
| Melody-Guided Music Generation | Sep 30, 2024 | cross-modal alignmentMusic Generation | CodeCode Available | 2 |