| FLUX that Plays Music | Sep 1, 2024 | Music GenerationText-to-Music Generation | CodeCode Available | 13 | 5 |
| YuE: Scaling Open Foundation Models for Long-Form Music Generation | Mar 11, 2025 | FormIn-Context Learning | CodeCode Available | 9 | 5 |
| DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion | Mar 3, 2025 | Music Generation | CodeCode Available | 7 | 5 |
| Long-form music generation with latent diffusion | Apr 16, 2024 | Audio GenerationForm | CodeCode Available | 7 | 5 |
| MusicLM: Generating Music From Text | Jan 26, 2023 | Music GenerationText-to-Music Generation | CodeCode Available | 6 | 5 |
| Simple and Controllable Music Generation | Jun 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 6 | 5 |
| LeVo: High-Quality Song Generation with Multi-Preference Alignment | Jun 9, 2025 | Instruction FollowingMusic Generation | CodeCode Available | 5 | 5 |
| InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Feb 28, 2025 | Audio GenerationForm | CodeCode Available | 5 | 5 |
| NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 | 5 |
| Quality-aware Masked Diffusion Transformer for Enhanced Music Generation | May 24, 2024 | DiversityMusic Generation | CodeCode Available | 4 | 5 |