| From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems | Apr 30, 2025 | Music Generation | —Unverified | 0 |
| Extending Visual Dynamics for Video-to-Music Generation | Apr 10, 2025 | Music GenerationOptical Flow Estimation | —Unverified | 0 |
| Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation | Apr 7, 2025 | AllMusic Generation | —Unverified | 0 |
| Deep learning for music generation. Four approaches and their comparative evaluation | Apr 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Survey on Music Generation from Single-Modal, Cross-Modal, and Multi-Modal Perspectives | Apr 1, 2025 | Music Generation | —Unverified | 0 |
| Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model | Mar 28, 2025 | Music Generation | —Unverified | 0 |
| Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation | Mar 25, 2025 | Music Generation | —Unverified | 0 |
| Towards Responsible AI Music: an Investigation of Trustworthy Features for Creative Systems | Mar 24, 2025 | EthicsFairness | —Unverified | 0 |
| AudioX: Diffusion Transformer for Anything-to-Audio Generation | Mar 13, 2025 | Audio GenerationMusic Generation | —Unverified | 0 |
| FilmComposer: LLM-Driven Music Production for Silent Film Clips | Mar 11, 2025 | Music GenerationRhythm | —Unverified | 0 |
| A Multimodal Symphony: Integrating Taste and Sound through Generative AI | Mar 4, 2025 | Music Generation | CodeCode Available | 0 |
| A Comprehensive Survey on Generative AI for Video-to-Music Generation | Feb 18, 2025 | Music Generation | —Unverified | 0 |
| Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models | Feb 17, 2025 | Music Generation | —Unverified | 0 |
| Video Soundtrack Generation by Aligning Emotions and Temporal Boundaries | Feb 14, 2025 | Music Generation | —Unverified | 0 |
| F-StrIPE: Fast Structure-Informed Positional Encoding for Symbolic Music Generation | Feb 14, 2025 | Music Generation | —Unverified | 0 |
| YNote: A Novel Music Notation for Fine-Tuning LLMs in Music Generation | Feb 12, 2025 | Music Generation | —Unverified | 0 |
| Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models | Feb 11, 2025 | AllMusic Generation | CodeCode Available | 0 |
| Music Generation using Human-In-The-Loop Reinforcement Learning | Jan 25, 2025 | Music GenerationQ-Learning | —Unverified | 0 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions | Jan 17, 2025 | DiversityMusic Generation | —Unverified | 0 |
| XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework | Jan 15, 2025 | Emotion RecognitionImage Generation | —Unverified | 0 |
| Unrolled Creative Adversarial Network For Generating Novel Musical Pieces | Dec 31, 2024 | Music Generation | CodeCode Available | 0 |
| Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants | Dec 25, 2024 | Music Generation | CodeCode Available | 0 |
| CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls | Dec 13, 2024 | DecoderMusic Generation | —Unverified | 0 |
| Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise | Dec 12, 2024 | DescriptiveMusic Generation | —Unverified | 0 |
| Watermarking Training Data of Music Generation Models | Dec 11, 2024 | Music Generation | —Unverified | 0 |
| Missing Melodies: AI Music Generation and its "Nearly" Complete Omission of the Global South | Dec 5, 2024 | Music Generation | —Unverified | 0 |
| MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI | Nov 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Training-Free Approach for Music Style Transfer with Latent Diffusion Models | Nov 24, 2024 | Music GenerationMusic Style Transfer | —Unverified | 0 |
| Generative AI for Music and Audio | Nov 21, 2024 | Music Generation | —Unverified | 0 |
| PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation | Nov 13, 2024 | Audio GenerationDiversity | —Unverified | 0 |
| Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks | Nov 6, 2024 | FormMusic Generation | CodeCode Available | 0 |
| Emotion-Guided Image to Music Generation | Oct 29, 2024 | Contrastive LearningMusic Generation | —Unverified | 0 |
| MusicFlow: Cascaded Flow Matching for Text Guided Music Generation | Oct 27, 2024 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| Music102: An D_12-equivariant transformer for chord progression accompaniment | Oct 23, 2024 | Music Generation | CodeCode Available | 0 |
| Exploring Tokenization Methods for Multitrack Sheet Music Generation | Oct 23, 2024 | Computational EfficiencyMusic Generation | —Unverified | 0 |
| MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization | Oct 16, 2024 | In-Context LearningMusic Generation | —Unverified | 0 |
| Do we need more complex representations for structure? A comparison of note duration representation for Music Transformers | Oct 14, 2024 | Music Generation | —Unverified | 0 |
| M2M-Gen: A Multimodal Framework for Automated Background Music Generation in Japanese Manga Using Large Language Models | Oct 13, 2024 | Emotion ClassificationMusic Generation | —Unverified | 0 |
| Efficient Fine-Grained Guidance for Diffusion Model Based Symbolic Music Generation | Oct 11, 2024 | Music Generation | —Unverified | 0 |
| Diversity-Rewarded CFG Distillation | Oct 8, 2024 | DiversityMusic Generation | —Unverified | 0 |
| Presto! Distilling Steps and Layers for Accelerating Music Generation | Oct 7, 2024 | DiversityMusic Generation | —Unverified | 0 |
| Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer | Oct 7, 2024 | Music GenerationMusic Style Transfer | —Unverified | 0 |
| Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing | Sep 17, 2024 | Music GenerationTAG | —Unverified | 0 |
| FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models | Sep 16, 2024 | Music Generation | CodeCode Available | 0 |
| Hierarchical Symbolic Pop Music Generation with Graph Neural Networks | Sep 12, 2024 | Music GenerationRhythm | —Unverified | 0 |
| Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning | Sep 12, 2024 | Code GenerationMusic Generation | —Unverified | 0 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos | Sep 11, 2024 | Contrastive LearningMusic Generation | —Unverified | 0 |