| Not that Groove: Zero-Shot Symbolic Music Editing | May 13, 2025 | Music Generation | —Unverified | 0 |
| Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation | May 6, 2025 | Image GenerationMamba | CodeCode Available | 1 |
| From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems | Apr 30, 2025 | Music Generation | —Unverified | 0 |
| Extending Visual Dynamics for Video-to-Music Generation | Apr 10, 2025 | Music GenerationOptical Flow Estimation | —Unverified | 0 |
| Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation | Apr 7, 2025 | AllMusic Generation | —Unverified | 0 |
| LoopGen: Training-Free Loopable Music Generation | Apr 6, 2025 | Music Generation | CodeCode Available | 1 |
| Deep learning for music generation. Four approaches and their comparative evaluation | Apr 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives | Apr 1, 2025 | Music Generation | —Unverified | 0 |
| Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model | Mar 28, 2025 | Music Generation | —Unverified | 0 |
| Vision-to-Music Generation: A Survey | Mar 27, 2025 | multimodal generationMusic Generation | CodeCode Available | 3 |
| Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation | Mar 25, 2025 | Music Generation | —Unverified | 0 |
| Towards Responsible AI Music: an Investigation of Trustworthy Features for Creative Systems | Mar 24, 2025 | EthicsFairness | —Unverified | 0 |
| AudioX: Diffusion Transformer for Anything-to-Audio Generation | Mar 13, 2025 | Audio GenerationMusic Generation | —Unverified | 0 |
| YuE: Scaling Open Foundation Models for Long-Form Music Generation | Mar 11, 2025 | FormIn-Context Learning | CodeCode Available | 9 |
| FilmComposer: LLM-Driven Music Production for Silent Film Clips | Mar 11, 2025 | Music GenerationRhythm | —Unverified | 0 |
| A Multimodal Symphony: Integrating Taste and Sound through Generative AI | Mar 4, 2025 | Music Generation | CodeCode Available | 0 |
| DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion | Mar 3, 2025 | Music Generation | CodeCode Available | 7 |
| InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Feb 28, 2025 | Audio GenerationForm | CodeCode Available | 5 |
| NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| A Comprehensive Survey on Generative AI for Video-to-Music Generation | Feb 18, 2025 | Music Generation | —Unverified | 0 |
| Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models | Feb 17, 2025 | Music Generation | —Unverified | 0 |
| F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation | Feb 14, 2025 | Music Generation | —Unverified | 0 |
| Video Soundtrack Generation by Aligning Emotions and Temporal Boundaries | Feb 14, 2025 | Music Generation | —Unverified | 0 |
| TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument | Feb 13, 2025 | Audio GenerationDecoder | CodeCode Available | 2 |
| YNote: A Novel Music Notation for Fine-Tuning LLMs in Music Generation | Feb 12, 2025 | Music Generation | —Unverified | 0 |