SOTAVerified

Rhythm

Papers

Showing 150 of 515 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
OpenVoice: Versatile Instant Voice CloningCode7
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode2
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural EmbeddingsCode2
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple DomainsCode2
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent SystemsCode2
Unsupervised Speech Decomposition via Triple Information BottleneckCode2
Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature ModellingCode1
MelodyGLM: Multi-task Pre-training for Symbolic Melody GenerationCode1
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised LearningCode1
M-Arg: Multimodal Argument Mining Dataset for Political Debates with Audio and TranscriptsCode1
Jam-ALT: A Formatting-Aware Lyrics Transcription BenchmarkCode1
IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG ClassificationCode1
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture GenerationCode1
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical ParametersCode1
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokensCode1
Music SketchNet: Controllable Music Generation via Factorized Representations of Pitch and RhythmCode1
EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture GenerationCode1
Multi-scale Cross-restoration Framework for Electrocardiogram Anomaly DetectionCode1
Music ControlNet: A model similar to SD ControlNetD that can accurately control music generationCode1
Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode1
GenéLive! Generating Rhythm Actions in Love Live!Code1
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease DiagnosisCode1
AesPA-Net: Aesthetic Pattern-Aware Style Transfer NetworksCode1
Development of Interpretable Machine Learning Models to Detect Arrhythmia based on ECG DataCode1
Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentationCode1
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion TransformerCode1
DanceIt: Music-inspired Dancing Video SynthesisCode1
DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus DetectionCode1
Detecting beats in the photoplethysmogram: benchmarking open-source algorithmsCode1
Cardiologist-Level Arrhythmia Detection with Convolutional Neural NetworksCode1
ECG Biometric Recognition: Review, System Proposal, and Benchmark EvaluationCode1
An Empirical Evaluation of End-to-End Polyphonic Optical Music RecognitionCode1
Anomaly Detection in Time Series with Triadic Motif Fields and Application in Atrial Fibrillation ECG ClassificationCode1
Continuous Melody Generation via Disentangled Short-Term Representations and Structural ConditionsCode1
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XLCode1
A holistic approach to polyphonic music transcription with neural networksCode1
How Does it Sound?Code1
ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption RefinementCode1
A Multi-Resolution Mutual Learning Network for Multi-Label ECG ClassificationCode1
Show:102550
← PrevPage 1 of 11Next →

No leaderboard results yet.