SOTAVerified

Text-to-Music Generation

Papers

Showing 2130 of 37 papers

TitleStatusHype
Fast Timing-Conditioned Latent Audio DiffusionCode7
PAM: Prompting Audio-Language Models for Audio Quality AssessmentCode2
The Song Describer Dataset: a Corpus of Audio Captions for Music-and-Language EvaluationCode1
Mustango: Toward Controllable Text-to-Music GenerationCode2
Music ControlNet: A model similar to SD ControlNetD that can accurately control music generationCode1
Investigating Personalization Methods in Text to Music GenerationCode1
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and CaptioningCode2
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised PretrainingCode4
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion ModelsCode1
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup StrategiesCode1
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AudioLDM2-musicFD_openl3354.05Unverified
2Stable AudioFD_openl3108.69Unverified
3RiffusionFAD13.4Unverified
4MubertFAD9.6Unverified
5MeLoDyFAD5.41Unverified
6MusicGen w/ random melody (1.5B)FAD5Unverified
7MusicLMFAD4Unverified
8Noise2Music spectrogramFAD3.84Unverified
9MusicGen w/o melody (3.3B)FAD3.8Unverified
10UniAudioFAD3.65Unverified
#ModelMetricClaimedVerifiedStatus
1Mustango (non-pretrained)FAD2.09Unverified