| Fast Timing-Conditioned Latent Audio Diffusion | Feb 7, 2024 | Audio GenerationGPU | CodeCode Available | 7 |
| PAM: Prompting Audio-Language Models for Audio Quality Assessment | Feb 1, 2024 | Audio Quality AssessmentMusic Generation | CodeCode Available | 2 |
| The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation | Nov 16, 2023 | Music CaptioningMusic Generation | CodeCode Available | 1 |
| Mustango: Toward Controllable Text-to-Music Generation | Nov 14, 2023 | Data AugmentationDenoising | CodeCode Available | 2 |
| Music ControlNet: A model similar to SD ControlNetD that can accurately control music generation | Nov 7, 2023 | Music GenerationRhythm | CodeCode Available | 1 |
| Investigating Personalization Methods in Text to Music Generation | Sep 20, 2023 | Data AugmentationMusic Generation | CodeCode Available | 1 |
| Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning | Aug 22, 2023 | Caption GenerationLarge Language Model | CodeCode Available | 2 |
| AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining | Aug 10, 2023 | Audio GenerationIn-Context Learning | CodeCode Available | 4 |
| JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models | Aug 9, 2023 | Computational EfficiencyIn-Context Learning | CodeCode Available | 1 |
| MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies | Aug 3, 2023 | Audio GenerationBeat Tracking | CodeCode Available | 1 |