| MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation | Jul 21, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Jun 1, 2024 | Gesture GenerationRhythm | CodeCode Available | 2 |
| Diff-BGM: A Diffusion Model for Video Background Music Generation | May 20, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Mar 14, 2024 | 3D Face AnimationDiversity | CodeCode Available | 2 |
| SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems | Jan 8, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| Rhythmic Gesticulator: Rhythm-Aware Co-Speech Gesture Synthesis with Hierarchical Neural Embeddings | Oct 4, 2022 | Gesture GenerationRhythm | CodeCode Available | 2 |
| Unsupervised Speech Decomposition via Triple Information Bottleneck | Apr 23, 2020 | RhythmStyle Transfer | CodeCode Available | 2 |
| ProtoECGNet: Case-Based Interpretable Deep Learning for Multi-Label ECG Classification with Contrastive Learning | Apr 11, 2025 | Contrastive LearningDeep Learning | CodeCode Available | 1 |
| ECG-Expert-QA: A Benchmark for Evaluating Medical Large Language Models in Heart Disease Diagnosis | Feb 16, 2025 | DiagnosticRhythm | CodeCode Available | 1 |
| Reading Your Heart: Learning ECG Words and Sentences via Pre-training ECG Language Model | Feb 15, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |