SOTAVerified

Text-to-Music Generation

Papers

Showing 110 of 37 papers

TitleStatusHype
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners0
Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models0
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation0
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-InstrumentCode2
Diffusion based Text-to-Music Generation with Global and Local Text based Conditioning0
ETTA: Elucidating the Design Space of Text-to-Audio ModelsCode2
Long-Form Text-to-Music Generation with Adaptive Prompts: A Case Study in Tabletop Role-Playing Games SoundtracksCode0
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation0
Editing Music with Melody and Text: Using ControlNet for Diffusion Transformer0
Melody-Guided Music GenerationCode2
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AudioLDM2-musicFD_openl3354.05Unverified
2Stable AudioFD_openl3108.69Unverified
3RiffusionFAD13.4Unverified
4MubertFAD9.6Unverified
5MeLoDyFAD5.41Unverified
6MusicGen w/ random melody (1.5B)FAD5Unverified
7MusicLMFAD4Unverified
8Noise2Music spectrogramFAD3.84Unverified
9MusicGen w/o melody (3.3B)FAD3.8Unverified
10UniAudioFAD3.65Unverified
#ModelMetricClaimedVerifiedStatus
1Mustango (non-pretrained)FAD2.09Unverified