| FLUX that Plays Music | Sep 1, 2024 | Music GenerationText-to-Music Generation | CodeCode Available | 13 |
| Combining audio control and style transfer using latent diffusion | Jul 31, 2024 | DisentanglementMusic Generation | —Unverified | 0 |
| MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation | Jul 21, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| Stable Audio Open | Jul 19, 2024 | Audio GenerationText-to-Music Generation | CodeCode Available | 7 |
| The Interpretation Gap in Text-to-Music Generation Models | Jul 14, 2024 | Information RetrievalMusic Generation | —Unverified | 0 |
| Improving Text-To-Audio Models with Synthetic Captions | Jun 18, 2024 | AudioCapsAudio captioning | CodeCode Available | 5 |
| JEN-1 DreamStyler: Customized Musical Concept Learning via Pivotal Parameters Tuning | Jun 18, 2024 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models | Jun 7, 2024 | FADText-to-Music Generation | CodeCode Available | 2 |
| Quality-aware Masked Diffusion Transformer for Enhanced Music Generation | May 24, 2024 | DiversityMusic Generation | CodeCode Available | 4 |
| MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models | Feb 9, 2024 | Music GenerationText-to-Music Generation | CodeCode Available | 1 |