| MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners | Jun 23, 2025 | AttributeAudio inpainting | —Unverified | 0 |
| Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models | Jun 18, 2025 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation | Jun 10, 2025 | Audio inpaintingMusic Generation | —Unverified | 0 |
| TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument | Feb 13, 2025 | Audio GenerationDecoder | CodeCode Available | 2 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| ETTA: Elucidating the Design Space of Text-to-Audio Models | Dec 26, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 |
| Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks | Nov 6, 2024 | FormMusic Generation | CodeCode Available | 0 |
| MusicFlow: Cascaded Flow Matching for Text Guided Music Generation | Oct 27, 2024 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer | Oct 7, 2024 | Music GenerationMusic Style Transfer | —Unverified | 0 |
| Melody-Guided Music Generation | Sep 30, 2024 | cross-modal alignmentMusic Generation | CodeCode Available | 2 |