| TSRNet: Simple Framework for Real-time ECG Anomaly Detection with Multimodal Time and Spectrogram Restoration Network | Dec 15, 2023 | Anomaly DetectionRhythm | CodeCode Available | 1 |
| Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Dec 7, 2023 | Gesture GenerationRhythm | CodeCode Available | 1 |
| Jam-ALT: A Formatting-Aware Lyrics Transcription Benchmark | Nov 23, 2023 | Automatic Lyrics TranscriptionRhythm | CodeCode Available | 1 |
| Music ControlNet: A model similar to SD ControlNetD that can accurately control music generation | Nov 7, 2023 | Music GenerationRhythm | CodeCode Available | 1 |
| Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model | Nov 2, 2023 | Music GenerationRhythm | CodeCode Available | 1 |
| MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation | Sep 19, 2023 | Rhythm | CodeCode Available | 1 |
| LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation | Sep 17, 2023 | Gesture GenerationRhythm | CodeCode Available | 1 |
| Multi-scale Cross-restoration Framework for Electrocardiogram Anomaly Detection | Aug 3, 2023 | Anomaly DetectionDiagnostic | CodeCode Available | 1 |
| AesPA-Net: Aesthetic Pattern-Aware Style Transfer Networks | Jul 19, 2023 | RhythmSemantic correspondence | CodeCode Available | 1 |
| Rhythm Modeling for Voice Conversion | Jul 12, 2023 | RhythmVoice Conversion | CodeCode Available | 1 |
| Unsupervised Melody-to-Lyric Generation | May 30, 2023 | DisentanglementRhythm | CodeCode Available | 1 |
| EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation | May 30, 2023 | Gesture GenerationRhythm | CodeCode Available | 1 |
| QPGesture: Quantization-Based and Phase-Guided Motion Matching for Natural Speech-Driven Gesture Generation | May 18, 2023 | Gesture GenerationQuantization | CodeCode Available | 1 |
| scPrisma infers, filters and enhances topological signals in single-cell data using spectral template matching | Feb 27, 2023 | RhythmTemplate Matching | CodeCode Available | 1 |
| Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units | Dec 19, 2022 | RhythmVoice Conversion | CodeCode Available | 1 |
| Self-Supervised PPG Representation Learning Shows High Inter-Subject Variability | Dec 7, 2022 | Activity RecognitionRepresentation Learning | CodeCode Available | 1 |
| A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units | Nov 12, 2022 | RhythmVoice Conversion | CodeCode Available | 1 |
| Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning | Sep 30, 2022 | ECG ClassificationKnowledge Distillation | CodeCode Available | 1 |
| The ReprGesture entry to the GENEA Challenge 2022 | Aug 25, 2022 | DecoderGesture Generation | CodeCode Available | 1 |
| Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion | Aug 18, 2022 | DisentanglementRhythm | CodeCode Available | 1 |
| Detecting beats in the photoplethysmogram: benchmarking open-source algorithms | Jul 19, 2022 | BenchmarkingPhotoplethysmography (PPG) beat detection | CodeCode Available | 1 |
| TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation | May 25, 2022 | Representation LearningRhythm | CodeCode Available | 1 |
| Development of Interpretable Machine Learning Models to Detect Arrhythmia based on ECG Data | May 5, 2022 | BIG-bench Machine LearningFeature Importance | CodeCode Available | 1 |
| ECG Biometric Recognition: Review, System Proposal, and Benchmark Evaluation | Apr 8, 2022 | Rhythm | CodeCode Available | 1 |
| IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG Classification | Apr 6, 2022 | ECG ClassificationRhythm | CodeCode Available | 1 |