SOTAVerified

Rhythm

Papers

Showing 125 of 515 papers

TitleStatusHype
OpenVoice: Versatile Instant Voice CloningCode7
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple DomainsCode2
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent SystemsCode2
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural EmbeddingsCode2
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
Unsupervised Speech Decomposition via Triple Information BottleneckCode2
Development of Interpretable Machine Learning Models to Detect Arrhythmia based on ECG DataCode1
ECG Biometric Recognition: Review, System Proposal, and Benchmark EvaluationCode1
Deep Music Analogy Via Latent Representation DisentanglementCode1
A holistic approach to polyphonic music transcription with neural networksCode1
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XLCode1
Detecting beats in the photoplethysmogram: benchmarking open-source algorithmsCode1
DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus DetectionCode1
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease DiagnosisCode1
Show:102550
← PrevPage 1 of 21Next →

No leaderboard results yet.