SOTAVerified

Rhythm

Papers

Showing 51100 of 515 papers

TitleStatusHype
SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder BottlenecksCode1
Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic MusicCode1
Arrhythmia Classifier Using Convolutional Neural Network with Adaptive Loss-aware Multi-bit Networks QuantizationCode1
GenéLive! Generating Rhythm Actions in Love Live!Code1
DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus DetectionCode1
How Does it Sound?Code1
Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentationCode1
M-Arg: Multimodal Argument Mining Dataset for Political Debates with Audio and TranscriptsCode1
BeatNet: CRNN and Particle Filtering for Online Joint Beat Downbeat and Meter TrackingCode1
An Empirical Evaluation of End-to-End Polyphonic Optical Music RecognitionCode1
Global Rhythm Style Transfer Without Text TranscriptionsCode1
RAD-TTS: Parallel Flow-Based TTS with Robust Alignment Learning and Diverse SynthesisCode1
LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical ParametersCode1
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head GenerationCode1
Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis TechniquesCode1
DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion TransformerCode1
Anomaly Detection in Time Series with Triadic Motif Fields and Application in Atrial Fibrillation ECG ClassificationCode1
Can GAN originate new electronic dance music genres? -- Generating novel rhythm patterns using GAN with Genre Ambiguity LossCode1
DanceIt: Music-inspired Dancing Video SynthesisCode1
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker IdentityCode1
Music SketchNet: Controllable Music Generation via Factorized Representations of Pitch and RhythmCode1
Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature ModellingCode1
Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XLCode1
Towards democratizing music production with AI-Design of Variational Autoencoder-based Rhythm Generator as a DAW pluginCode1
Continuous Melody Generation via Disentangled Short-Term Representations and Structural ConditionsCode1
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokensCode1
A holistic approach to polyphonic music transcription with neural networksCode1
Deep Music Analogy Via Latent Representation DisentanglementCode1
Cardiologist-Level Arrhythmia Detection with Convolutional Neural NetworksCode1
Exploring Adapter Design Tradeoffs for Low Resource Music Generation0
CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment0
Let Your Video Listen to Your Music!0
From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-trainingCode0
DanceChat: Large Language Model-Guided Music-to-Dance Generation0
Rhythm Features for Speaker Identification0
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric SpeechCode0
Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching0
Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and AssessmentCode0
Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism0
Source Tracing of Synthetic Speech Systems Through Paralinguistic Pre-Trained Representations0
PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts0
MAVL: A Multilingual Audio-Video Lyrics Dataset for Animated Song TranslationCode0
EASY: Emotion-aware Speaker Anonymization via Factorized Distillation0
Early Diagnosis of Atrial Fibrillation Recurrence: A Large Tabular Model Approach with Structured and Unstructured Clinical Data0
IoT-Enabled Hemodynamic Surveillance System: AD8232 Bioelectric Signal Processing with ESP320
R-CAGE: A Structural Model for Emotion Output Design in Human-AI Interaction0
NeuroPal: A Clinically-Informed Multimodal LLM Assistant for Mental Health Combining Sleep Chronotherapy, Cognitive Behavioral Reframing, and Adaptive Phytochemical Intervention0
ArrhythmiaVision: Resource-Conscious Deep Learning Models with Visual Explanations for ECG Arrhythmia Classification0
Evolutionary Optimization for the Classification of Small Molecules Regulating the Circadian Rhythm Period: A Reliable AssessmentCode0
On the robustness of the emergent spatiotemporal dynamics in biophysically realistic and phenomenological whole-brain models at multiple network resolutions0
Show:102550
← PrevPage 2 of 11Next →

No leaderboard results yet.