| SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks | Mar 26, 2022 | DisentanglementRhythm | CodeCode Available | 1 |
| Pseudo-Label Transfer from Frame-Level to Note-Level in a Teacher-Student Framework for Singing Transcription from Polyphonic Music | Mar 25, 2022 | Pseudo LabelQuantization | CodeCode Available | 1 |
| Arrhythmia Classifier Using Convolutional Neural Network with Adaptive Loss-aware Multi-bit Networks Quantization | Feb 27, 2022 | Arrhythmia DetectionQuantization | CodeCode Available | 1 |
| GenéLive! Generating Rhythm Actions in Love Live! | Feb 25, 2022 | Rhythm | CodeCode Available | 1 |
| DEEPCHORUS: A Hybrid Model of Multi-scale Convolution and Self-attention for Chorus Detection | Feb 13, 2022 | Rhythm | CodeCode Available | 1 |
| How Does it Sound? | Dec 1, 2021 | Rhythm | CodeCode Available | 1 |
| Generalizing electrocardiogram delineation -- Training convolutional neural networks with synthetic data augmentation | Nov 25, 2021 | Data AugmentationRhythm | CodeCode Available | 1 |
| M-Arg: Multimodal Argument Mining Dataset for Political Debates with Audio and Transcripts | Nov 1, 2021 | Argument MiningRhythm | CodeCode Available | 1 |
| BeatNet: CRNN and Particle Filtering for Online Joint Beat Downbeat and Meter Tracking | Aug 8, 2021 | Downbeat TrackingOnline Beat Tracking | CodeCode Available | 1 |
| An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition | Aug 3, 2021 | Binary ClassificationDecoder | CodeCode Available | 1 |
| Global Rhythm Style Transfer Without Text Transcriptions | Jun 16, 2021 | Representation LearningRhythm | CodeCode Available | 1 |
| RAD-TTS: Parallel Flow-Based TTS with Robust Alignment Learning and Diverse Synthesis | Jun 2, 2021 | DiversityRhythm | CodeCode Available | 1 |
| LoopNet: Musical Loop Synthesis Conditioned On Intuitive Musical Parameters | May 21, 2021 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 |
| Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation | Apr 16, 2021 | Face ModelRhythm | CodeCode Available | 1 |
| Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques | Apr 2, 2021 | DecoderRhythm | CodeCode Available | 1 |
| DanceFormer: Music Conditioned 3D Dance Generation with Parametric Motion Transformer | Mar 18, 2021 | Rhythm | CodeCode Available | 1 |
| Anomaly Detection in Time Series with Triadic Motif Fields and Application in Atrial Fibrillation ECG Classification | Dec 9, 2020 | Anomaly DetectionAtrial Fibrillation Detection | CodeCode Available | 1 |
| Can GAN originate new electronic dance music genres? -- Generating novel rhythm patterns using GAN with Genre Ambiguity Loss | Nov 25, 2020 | Deep LearningMusic Generation | CodeCode Available | 1 |
| DanceIt: Music-inspired Dancing Video Synthesis | Sep 17, 2020 | cross-modal alignmentRhythm | CodeCode Available | 1 |
| Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity | Sep 4, 2020 | Gesture GenerationRhythm | CodeCode Available | 1 |
| Music SketchNet: Controllable Music Generation via Factorized Representations of Pitch and Rhythm | Aug 4, 2020 | Music GenerationRhythm | CodeCode Available | 1 |
| Music FaderNets: Controllable Music Generation Based On High-Level Features via Low-Level Feature Modelling | Jul 29, 2020 | ClusteringDisentanglement | CodeCode Available | 1 |
| Deep Learning for ECG Analysis: Benchmarks and Insights from PTB-XL | Apr 28, 2020 | AllBenchmarking | CodeCode Available | 1 |
| Towards democratizing music production with AI-Design of Variational Autoencoder-based Rhythm Generator as a DAW plugin | Apr 1, 2020 | Deep LearningMusic Generation | CodeCode Available | 1 |
| Continuous Melody Generation via Disentangled Short-Term Representations and Structural Conditions | Feb 5, 2020 | DisentanglementMusic Generation | CodeCode Available | 1 |
| Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens | Oct 26, 2019 | RhythmStyle Transfer | CodeCode Available | 1 |
| A holistic approach to polyphonic music transcription with neural networks | Oct 26, 2019 | Beat TrackingMusic Transcription | CodeCode Available | 1 |
| Deep Music Analogy Via Latent Representation Disentanglement | Jun 9, 2019 | DisentanglementRhythm | CodeCode Available | 1 |
| Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks | Jul 6, 2017 | Arrhythmia DetectionElectrocardiography (ECG) | CodeCode Available | 1 |
| Exploring Adapter Design Tradeoffs for Low Resource Music Generation | Jun 26, 2025 | Music Generationparameter-efficient fine-tuning | —Unverified | 0 |
| CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment | Jun 25, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Let Your Video Listen to Your Music! | Jun 23, 2025 | GPUMusic Generation | —Unverified | 0 |
| From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training | Jun 20, 2025 | Music GenerationRhythm | CodeCode Available | 0 |
| DanceChat: Large Language Model-Guided Music-to-Dance Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rhythm Features for Speaker Identification | Jun 7, 2025 | Deep LearningRhythm | —Unverified | 0 |
| Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech | Jun 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching | Jun 1, 2025 | RhythmStyle Transfer | —Unverified | 0 |
| Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment | Jun 1, 2025 | Classificationregression | CodeCode Available | 0 |
| Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism | Jun 1, 2025 | Rhythm | —Unverified | 0 |
| Source Tracing of Synthetic Speech Systems Through Paralinguistic Pre-Trained Representations | Jun 1, 2025 | Emotion RecognitionRhythm | —Unverified | 0 |
| PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts | May 27, 2025 | DiversityRhythm | —Unverified | 0 |
| MAVL: A Multilingual Audio-Video Lyrics Dataset for Animated Song Translation | May 24, 2025 | RhythmTranslation | CodeCode Available | 0 |
| EASY: Emotion-aware Speaker Anonymization via Factorized Distillation | May 21, 2025 | AttributeDisentanglement | —Unverified | 0 |
| Early Diagnosis of Atrial Fibrillation Recurrence: A Large Tabular Model Approach with Structured and Unstructured Clinical Data | May 20, 2025 | Rhythm | —Unverified | 0 |
| IoT-Enabled Hemodynamic Surveillance System: AD8232 Bioelectric Signal Processing with ESP32 | May 14, 2025 | Rhythm | —Unverified | 0 |
| R-CAGE: A Structural Model for Emotion Output Design in Human-AI Interaction | May 11, 2025 | Rhythm | —Unverified | 0 |
| NeuroPal: A Clinically-Informed Multimodal LLM Assistant for Mental Health Combining Sleep Chronotherapy, Cognitive Behavioral Reframing, and Adaptive Phytochemical Intervention | May 10, 2025 | Large Language ModelRhythm | —Unverified | 0 |
| ArrhythmiaVision: Resource-Conscious Deep Learning Models with Visual Explanations for ECG Arrhythmia Classification | Apr 30, 2025 | Computational EfficiencyElectrocardiography (ECG) | —Unverified | 0 |
| Evolutionary Optimization for the Classification of Small Molecules Regulating the Circadian Rhythm Period: A Reliable Assessment | Apr 27, 2025 | Classificationfeature selection | CodeCode Available | 0 |
| On the robustness of the emergent spatiotemporal dynamics in biophysically realistic and phenomenological whole-brain models at multiple network resolutions | Apr 24, 2025 | Rhythm | —Unverified | 0 |