SOTAVerified

Rhythm

Papers

Showing 125 of 515 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
OpenVoice: Versatile Instant Voice CloningCode7
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple DomainsCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent SystemsCode2
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural EmbeddingsCode2
Unsupervised Speech Decomposition via Triple Information BottleneckCode2
ProtoECGNet: Case-Based Interpretable Deep Learning for Multi-Label ECG Classification with Contrastive LearningCode1
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease DiagnosisCode1
Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language ModelCode1
ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption RefinementCode1
A Multi-Resolution Mutual Learning Network for Multi-Label ECG ClassificationCode1
Singing Voice Graph Modeling for SingFake DetectionCode1
Perception-Inspired Graph Convolution for Music Understanding TasksCode1
SDEMG: Score-based Diffusion Model for Surface Electromyographic Signal DenoisingCode1
Show:102550
← PrevPage 1 of 21Next →

No leaderboard results yet.