| Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning | Aug 22, 2023 | Caption GenerationLarge Language Model | CodeCode Available | 2 | 5 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 | 5 |
| LP-MusicCaps: LLM-Based Pseudo Music Captioning | Jul 31, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response | Sep 15, 2023 | Caption GenerationLanguage Modelling | CodeCode Available | 1 | 5 |
| The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language Evaluation | Nov 16, 2023 | Music CaptioningMusic Generation | CodeCode Available | 1 | 5 |
| MusCaps: Generating Captions for Music Audio | Apr 24, 2021 | Audio captioningClassification | CodeCode Available | 1 | 5 |
| Evaluation of pretrained language models on music understanding | Sep 17, 2024 | Music CaptioningNegation | CodeCode Available | 0 | 5 |
| Futga: Towards Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation | Jul 29, 2024 | Music CaptioningMusic Generation | CodeCode Available | 0 | 5 |
| ALCAP: Alignment-Augmented Music Captioner | Dec 21, 2022 | Contrastive LearningMusic Captioning | CodeCode Available | 0 | 5 |
| Towards Music Captioning: Generating Music Playlist Descriptions | Aug 17, 2016 | Music Captioning | —Unverified | 0 | 0 |