| Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models | Feb 11, 2025 | AllMusic Generation | CodeCode Available | 0 |
| ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption Refinement | Feb 6, 2025 | Music GenerationRhythm | CodeCode Available | 1 |
| Music Generation using Human-In-The-Loop Reinforcement Learning | Jan 25, 2025 | Music GenerationQ-Learning | —Unverified | 0 |
| Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning | Jan 24, 2025 | FADLanguage Modeling | —Unverified | 0 |
| GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions | Jan 17, 2025 | DiversityMusic Generation | —Unverified | 0 |
| XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework | Jan 15, 2025 | Emotion RecognitionImage Generation | —Unverified | 0 |
| Unrolled Creative Adversarial Network For Generating Novel Musical Pieces | Dec 31, 2024 | Music Generation | CodeCode Available | 0 |
| ETTA: Elucidating the Design Space of Text-to-Audio Models | Dec 26, 2024 | AudioCapsAudio captioning | CodeCode Available | 2 |
| Zema Dataset: A Comprehensive Study of Yaredawi Zema with a Focus on Horologium Chants | Dec 25, 2024 | Music Generation | CodeCode Available | 0 |
| CSL-L2M: Controllable Song-Level Lyric-to-Melody Generation Based on Conditional Transformer with Fine-Grained Lyric and Musical Controls | Dec 13, 2024 | DecoderMusic Generation | —Unverified | 0 |
| Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise | Dec 12, 2024 | DescriptiveMusic Generation | —Unverified | 0 |
| Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation | Dec 12, 2024 | cross-modal alignmentMultimodal Music Generation | CodeCode Available | 1 |
| Predicting Human Brain States with Transformer | Dec 11, 2024 | Language ModellingMusic Generation | CodeCode Available | 2 |
| Watermarking Training Data of Music Generation Models | Dec 11, 2024 | Music Generation | —Unverified | 0 |
| Frechet Music Distance: A Metric For Generative Symbolic Music Evaluation | Dec 10, 2024 | FADMusic Generation | CodeCode Available | 1 |
| Missing Melodies: AI Music Generation and its "Nearly" Complete Omission of the Global South | Dec 5, 2024 | Music Generation | —Unverified | 0 |
| MusicGen-Chord: Advancing Music Generation through Chord Progressions and Interactive Web-UI | Nov 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Training-Free Approach for Music Style Transfer with Latent Diffusion Models | Nov 24, 2024 | Music GenerationMusic Style Transfer | —Unverified | 0 |
| Generative AI for Music and Audio | Nov 21, 2024 | Music Generation | —Unverified | 0 |
| PerceiverS: A Multi-Scale Perceiver with Effective Segmentation for Long-Term Expressive Symbolic Music Generation | Nov 13, 2024 | Audio GenerationDiversity | —Unverified | 0 |
| Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games Soundtracks | Nov 6, 2024 | FormMusic Generation | CodeCode Available | 0 |
| Emotion-Guided Image to Music Generation | Oct 29, 2024 | Contrastive LearningMusic Generation | —Unverified | 0 |
| MusicFlow: Cascaded Flow Matching for Text Guided Music Generation | Oct 27, 2024 | Music GenerationText-to-Music Generation | —Unverified | 0 |
| Symbotunes: unified hub for symbolic music generative models | Oct 27, 2024 | Music Generation | CodeCode Available | 1 |
| Music102: An D_12-equivariant transformer for chord progression accompaniment | Oct 23, 2024 | Music Generation | CodeCode Available | 0 |
| Exploring Tokenization Methods for Multitrack Sheet Music Generation | Oct 23, 2024 | Computational EfficiencyMusic Generation | —Unverified | 0 |
| MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization | Oct 16, 2024 | In-Context LearningMusic Generation | —Unverified | 0 |
| Do we need more complex representations for structure? A comparison of note duration representation for Music Transformers | Oct 14, 2024 | Music Generation | —Unverified | 0 |
| M2M-Gen: A Multimodal Framework for Automated Background Music Generation in Japanese Manga Using Large Language Models | Oct 13, 2024 | Emotion ClassificationMusic Generation | —Unverified | 0 |
| Efficient Fine-Grained Guidance for Diffusion Model Based Symbolic Music Generation | Oct 11, 2024 | Music Generation | —Unverified | 0 |
| Diversity-Rewarded CFG Distillation | Oct 8, 2024 | DiversityMusic Generation | —Unverified | 0 |
| Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer | Oct 7, 2024 | Music GenerationMusic Style Transfer | —Unverified | 0 |
| Presto! Distilling Steps and Layers for Accelerating Music Generation | Oct 7, 2024 | DiversityMusic Generation | —Unverified | 0 |
| Do Music Generation Models Encode Music Theory? | Oct 1, 2024 | Emotion RecognitionGenre classification | CodeCode Available | 1 |
| Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces | Oct 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Melody-Guided Music Generation | Sep 30, 2024 | cross-modal alignmentMusic Generation | CodeCode Available | 2 |
| Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance | Sep 23, 2024 | Emotion RecognitionFAD | CodeCode Available | 3 |
| PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing | Sep 17, 2024 | Music GenerationTAG | —Unverified | 0 |
| FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models | Sep 16, 2024 | Music Generation | CodeCode Available | 0 |
| Hierarchical Symbolic Pop Music Generation with Graph Neural Networks | Sep 12, 2024 | Music GenerationRhythm | —Unverified | 0 |
| Tidal MerzA: Combining affective modelling and autonomous code generation through Reinforcement Learning | Sep 12, 2024 | Code GenerationMusic Generation | —Unverified | 0 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos | Sep 11, 2024 | Contrastive LearningMusic Generation | —Unverified | 0 |
| Multi-Source Music Generation with Latent Diffusion | Sep 10, 2024 | FADMusic Generation | CodeCode Available | 1 |
| SongCreator: Lyrics-based Universal Song Generation | Sep 9, 2024 | Language ModellingMusic Generation | —Unverified | 0 |
| Applications and Advances of Artificial Intelligence in Music Generation:A Review | Sep 3, 2024 | Audio GenerationMusic Generation | —Unverified | 0 |
| MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT | Sep 2, 2024 | Generative Adversarial NetworkMusic Generation | —Unverified | 0 |
| FLUX that Plays Music | Sep 1, 2024 | Music GenerationText-to-Music Generation | CodeCode Available | 13 |
| Unifying Multitrack Music Arrangement via Reconstruction Fine-Tuning and Efficient Tokenization | Aug 27, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Controlling Surprisal in Music Generation via Information Content Curve Matching | Aug 12, 2024 | Music Generation | CodeCode Available | 0 |