| The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation | Nov 16, 2023 | Music CaptioningMusic Generation | CodeCode Available | 1 |
| Mustango: Toward Controllable Text-to-Music Generation | Nov 14, 2023 | Data AugmentationDenoising | CodeCode Available | 2 |
| Exploring Variational Auto-Encoder Architectures, Configurations, and Datasets for Generative Music Explainable AI | Nov 14, 2023 | AttributeMusic Generation | CodeCode Available | 1 |
| Music ControlNet: A model similar to SD ControlNetD that can accurately control music generation | Nov 7, 2023 | Music GenerationRhythm | CodeCode Available | 1 |
| Are Words Enough? On the semantic conditioning of affective music generation | Nov 7, 2023 | Deep LearningMusic Generation | —Unverified | 0 |
| Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model | Nov 2, 2023 | Music GenerationRhythm | CodeCode Available | 1 |
| JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation | Oct 29, 2023 | Music Generation | CodeCode Available | 1 |
| miditok: A Python package for MIDI file tokenization | Oct 26, 2023 | Music GenerationMusic Information Retrieval | —Unverified | 0 |
| Content-based Controls For Music Large Language Modeling | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Structured Multi-Track Accompaniment Arrangement via Style Prior Modelling | Oct 25, 2023 | Computational EfficiencyDisentanglement | CodeCode Available | 1 |