Rhythm

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 515 papers

Title	Date	Tasks	Status	Hype	Score
OpenVoice: Versatile Instant Voice Cloning	Dec 3, 2023	RhythmVoice Cloning	CodeCode Available	7	5
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark	Jun 5, 2025	RhythmSpoken Language Understanding	CodeCode Available	7	5
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control	Sep 24, 2024	ClusteringLanguage Modelling	CodeCode Available	3	5
SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition	Feb 27, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	3	5
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis	May 16, 2024	Language ModellingLarge Language Model	CodeCode Available	3	5
FlashSpeech: Efficient Zero-Shot Speech Synthesis	Apr 23, 2024	RhythmSpeech Synthesis	CodeCode Available	3	5
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling	Dec 31, 2023	3D Face AnimationDiversity	CodeCode Available	3	5
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	May 5, 2025	AI AgentAutomatic Speech Recognition	CodeCode Available	3	5
Diff-BGM: A Diffusion Model for Video Background Music Generation	May 20, 2024	DiversityMusic Generation	CodeCode Available	2	5
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple Domains	Oct 5, 2024	DiagnosticEvent Detection	CodeCode Available	2	5
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems	Jan 8, 2024	Language ModellingLarge Language Model	CodeCode Available	2	5
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation	Jul 21, 2024	DiversityMusic Generation	CodeCode Available	2	5
Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings	Oct 4, 2022	Gesture GenerationRhythm	CodeCode Available	2	5
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models	Mar 14, 2024	3D Face AnimationDiversity	CodeCode Available	2	5
AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion	Jun 1, 2024	Gesture GenerationRhythm	CodeCode Available	2	5
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation	Aug 5, 2024	RhythmSelf-Supervised Learning	CodeCode Available	2	5
Unsupervised Speech Decomposition via Triple Information Bottleneck	Apr 23, 2020	RhythmStyle Transfer	CodeCode Available	2	5
Development of Interpretable Machine Learning Models to Detect Arrhythmia based on ECG Data	May 5, 2022	BIG-bench Machine LearningFeature Importance	CodeCode Available	1	5
ECG Biometric Recognition: Review, System Proposal, and Benchmark Evaluation	Apr 8, 2022	Rhythm	CodeCode Available	1	5
Deep Music Analogy Via Latent Representation Disentanglement	Jun 9, 2019	DisentanglementRhythm	CodeCode Available	1	5
A holistic approach to polyphonic music transcription with neural networks	Oct 26, 2019	Beat TrackingMusic Transcription	CodeCode Available	1	5
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL	Apr 28, 2020	AllBenchmarking	CodeCode Available	1	5
Detecting beats in the photoplethysmogram: benchmarking open-source algorithms	Jul 19, 2022	BenchmarkingPhotoplethysmography (PPG) beat detection	CodeCode Available	1	5
DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus Detection	Feb 13, 2022	Rhythm	CodeCode Available	1	5
ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis	Feb 16, 2025	DiagnosticRhythm	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 21Next →

No leaderboard results yet.