SOTAVerified

Rhythm

Papers

Showing 150 of 515 papers

TitleStatusHype
OpenVoice: Versatile Instant Voice CloningCode7
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
FlashSpeech: Efficient Zero-Shot Speech SynthesisCode3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style ControlCode3
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture SynthesisCode3
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
SongComposer: A Large Language Model for Lyric and Melody Generation in Song CompositionCode3
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music GenerationCode2
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space ModelsCode2
AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode2
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple DomainsCode2
Diff-BGM: A Diffusion Model for Video Background Music GenerationCode2
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent SystemsCode2
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility EstimationCode2
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural EmbeddingsCode2
Unsupervised Speech Decomposition via Triple Information BottleneckCode2
Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature ModellingCode1
Emotional Speech-driven 3D Body Animation via Disentangled Latent DiffusionCode1
Multi-scale Cross-restoration Framework for Electrocardiogram Anomaly DetectionCode1
MelodyGLM: Multi-task Pre-training for Symbolic Melody GenerationCode1
EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture GenerationCode1
A holistic approach to polyphonic music transcription with neural networksCode1
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease DiagnosisCode1
ECG Biometric Recognition: Review, System Proposal, and Benchmark EvaluationCode1
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokensCode1
M-Arg: Multimodal Argument Mining Dataset for Political Debates with Audio and TranscriptsCode1
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised LearningCode1
Music ControlNet: A model similar to SD ControlNetD that can accurately control music generationCode1
Music SketchNet: Controllable Music Generation via Factorized Representations of Pitch and RhythmCode1
ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption RefinementCode1
DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus DetectionCode1
Jam-ALT: A Formatting-Aware Lyrics Transcription BenchmarkCode1
How Does it Sound?Code1
Detecting beats in the photoplethysmogram: benchmarking open-source algorithmsCode1
IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG ClassificationCode1
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture GenerationCode1
Cardiologist-Level Arrhythmia Detection with Convolutional Neural NetworksCode1
AesPA-Net: Aesthetic Pattern-Aware Style Transfer NetworksCode1
Continuous Melody Generation via Disentangled Short-Term Representations and Structural ConditionsCode1
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion TransformerCode1
DanceIt: Music-inspired Dancing Video SynthesisCode1
An Empirical Evaluation of End-to-End Polyphonic Optical Music RecognitionCode1
Anomaly Detection in Time Series with Triadic Motif Fields and Application in Atrial Fibrillation ECG ClassificationCode1
Deep Music Analogy Via Latent Representation DisentanglementCode1
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XLCode1
GenéLive! Generating Rhythm Actions in Love Live!Code1
Can GAN originate new electronic dance music genres? -- Generating novel rhythm patterns using GAN with Genre Ambiguity LossCode1
Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentationCode1
A Multi-Resolution Mutual Learning Network for Multi-Label ECG ClassificationCode1
Show:102550
← PrevPage 1 of 11Next →

No leaderboard results yet.