SOTAVerified

Text-to-Music Generation

Papers

Showing 1120 of 37 papers

TitleStatusHype
FLUX that Plays MusicCode13
Combining audio control and style transfer using latent diffusion0
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
Stable Audio OpenCode7
The Interpretation Gap in Text-to-Music Generation Models0
Improving Text-To-Audio Models with Synthetic CaptionsCode5
JEN-1 DreamStyler: Customized Musical Concept Learning via Pivotal Parameters Tuning0
MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion ModelsCode2
Quality-aware Masked Diffusion Transformer for Enhanced Music GenerationCode4
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion ModelsCode1
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AudioLDM2-musicFD_openl3354.05Unverified
2Stable AudioFD_openl3108.69Unverified
3RiffusionFAD13.4Unverified
4MubertFAD9.6Unverified
5MeLoDyFAD5.41Unverified
6MusicGen w/ random melody (1.5B)FAD5Unverified
7MusicLMFAD4Unverified
8Noise2Music spectrogramFAD3.84Unverified
9MusicGen w/o melody (3.3B)FAD3.8Unverified
10UniAudioFAD3.65Unverified
#ModelMetricClaimedVerifiedStatus
1Mustango (non-pretrained)FAD2.09Unverified