| WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling | Jul 14, 2025 | Music Generation | CodeCode Available | 1 |
| MusiScene: Leveraging MU-LLaMA for Scene Imagination and Enhanced Video Background Music Generation | Jul 8, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TOMI: Transforming and Organizing Music Ideas for Multi-Track Compositions with Full-Song Structure | Jun 29, 2025 | Music Generation | CodeCode Available | 1 |
| Exploring Adapter Design Tradeoffs for Low Resource Music Generation | Jun 26, 2025 | Music Generationparameter-efficient fine-tuning | —Unverified | 0 |
| MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners | Jun 23, 2025 | AttributeAudio inpainting | —Unverified | 0 |
| Let Your Video Listen to Your Music! | Jun 23, 2025 | GPUMusic Generation | —Unverified | 0 |
| Benchmarking Music Generation Models and Metrics via Human Preference Studies | Jun 23, 2025 | BenchmarkingMusic Generation | —Unverified | 0 |
| AI-Generated Song Detection via Lyrics Transcripts | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training | Jun 20, 2025 | Music GenerationRhythm | CodeCode Available | 0 |
| Double Entendre: Robust Audio-Based AI-Generated Lyrics Detection via Multi-View Fusion | Jun 19, 2025 | Music Generation | CodeCode Available | 0 |
| Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models | Jun 18, 2025 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| Personalizable Long-Context Symbolic Music Infilling with MIDI-RWKV | Jun 16, 2025 | Music Generation | CodeCode Available | 1 |
| BNMusic: Blending Environmental Noises into Personalized Music | Jun 12, 2025 | Music Generation | —Unverified | 0 |
| Fine-Grained control over Music Generation with Activation Steering | Jun 11, 2025 | Music GenerationStyle Transfer | —Unverified | 0 |
| Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation | Jun 10, 2025 | Audio inpaintingMusic Generation | —Unverified | 0 |
| SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement | Jun 9, 2025 | Music Generation | CodeCode Available | 4 |
| LeVo: High-Quality Song Generation with Multi-Preference Alignment | Jun 9, 2025 | Instruction FollowingMusic Generation | CodeCode Available | 5 |
| Improving AI-generated music with user-guided training | Jun 5, 2025 | Image GenerationMusic Generation | —Unverified | 0 |
| MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction | May 29, 2025 | ImputationMusic Generation | —Unverified | 0 |
| Towards Video to Piano Music Generation with Chain-of-Perform Support Benchmarks | May 26, 2025 | Music Generation | CodeCode Available | 1 |
| Moonbeam: A MIDI Foundation Model Using Both Absolute and Relative Music Attributes | May 21, 2025 | Music ClassificationMusic Generation | CodeCode Available | 2 |
| Text2midi-InferAlign: Improving Symbolic Music Generation with Inference-Time Alignment | May 19, 2025 | Music Generation | CodeCode Available | 1 |
| Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio | May 19, 2025 | Audio GenerationInformation Retrieval | —Unverified | 0 |
| DPN-GAN: Inducing Periodic Activations in Generative Adversarial Networks for High-Fidelity Audio Synthesis | May 14, 2025 | Audio GenerationAudio Synthesis | —Unverified | 0 |
| GlobalMood: A cross-cultural benchmark for music emotion recognition | May 14, 2025 | Emotion RecognitionMusic Emotion Recognition | —Unverified | 0 |
| Not that Groove: Zero-Shot Symbolic Music Editing | May 13, 2025 | Music Generation | —Unverified | 0 |
| Mamba-Diffusion Model with Learnable Wavelet for Controllable Symbolic Music Generation | May 6, 2025 | Image GenerationMamba | CodeCode Available | 1 |
| From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems | Apr 30, 2025 | Music Generation | —Unverified | 0 |
| Extending Visual Dynamics for Video-to-Music Generation | Apr 10, 2025 | Music GenerationOptical Flow Estimation | —Unverified | 0 |
| Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation | Apr 7, 2025 | AllMusic Generation | —Unverified | 0 |
| LoopGen: Training-Free Loopable Music Generation | Apr 6, 2025 | Music Generation | CodeCode Available | 1 |
| Deep learning for music generation. Four approaches and their comparative evaluation | Apr 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives | Apr 1, 2025 | Music Generation | —Unverified | 0 |
| Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model | Mar 28, 2025 | Music Generation | —Unverified | 0 |
| Vision-to-Music Generation: A Survey | Mar 27, 2025 | multimodal generationMusic Generation | CodeCode Available | 3 |
| Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation | Mar 25, 2025 | Music Generation | —Unverified | 0 |
| Towards Responsible AI Music: an Investigation of Trustworthy Features for Creative Systems | Mar 24, 2025 | EthicsFairness | —Unverified | 0 |
| AudioX: Diffusion Transformer for Anything-to-Audio Generation | Mar 13, 2025 | Audio GenerationMusic Generation | —Unverified | 0 |
| YuE: Scaling Open Foundation Models for Long-Form Music Generation | Mar 11, 2025 | FormIn-Context Learning | CodeCode Available | 9 |
| FilmComposer: LLM-Driven Music Production for Silent Film Clips | Mar 11, 2025 | Music GenerationRhythm | —Unverified | 0 |
| A Multimodal Symphony: Integrating Taste and Sound through Generative AI | Mar 4, 2025 | Music Generation | CodeCode Available | 0 |
| DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion | Mar 3, 2025 | Music Generation | CodeCode Available | 7 |
| InspireMusic: Integrating Super Resolution and Large Language Model for High-Fidelity Long-Form Music Generation | Feb 28, 2025 | Audio GenerationForm | CodeCode Available | 5 |
| NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| A Comprehensive Survey on Generative AI for Video-to-Music Generation | Feb 18, 2025 | Music Generation | —Unverified | 0 |
| Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models | Feb 17, 2025 | Music Generation | —Unverified | 0 |
| F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation | Feb 14, 2025 | Music Generation | —Unverified | 0 |
| Video Soundtrack Generation by Aligning Emotions and Temporal Boundaries | Feb 14, 2025 | Music Generation | —Unverified | 0 |
| TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument | Feb 13, 2025 | Audio GenerationDecoder | CodeCode Available | 2 |
| YNote: A Novel Music Notation for Fine-Tuning LLMs in Music Generation | Feb 12, 2025 | Music Generation | —Unverified | 0 |