SOTAVerified

Rhythm

Papers

Showing 110 of 515 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
OpenVoice: Versatile Instant Voice CloningCode7
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
Show:102550
← PrevPage 1 of 52Next →

No leaderboard results yet.