| Enhancing Music Genre Classification through Multi-Algorithm Analysis and User-Friendly Visualization | May 27, 2024 | Genre classificationMusic Genre Classification | —Unverified | 0 |
| Diff-BGM: A Diffusion Model for Video Background Music Generation | May 20, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| A novel Reservoir Architecture for Periodic Time Series Prediction | May 16, 2024 | RhythmTime Series | —Unverified | 0 |
| Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis | May 16, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| Dance Any Beat: Blending Beats with Visuals in Dance Video Generation | May 15, 2024 | Image to Video GenerationOptical Flow Estimation | —Unverified | 0 |
| Perception-Inspired Graph Convolution for Music Understanding Tasks | May 15, 2024 | Graph ClassificationGraph Learning | CodeCode Available | 1 |
| FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation | May 13, 2024 | Rhythm | —Unverified | 0 |
| Exploring Speech Pattern Disorders in Autism using Machine Learning | May 3, 2024 | Diagnosticregression | —Unverified | 0 |
| Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model | Apr 30, 2024 | DescriptiveGesture Generation | —Unverified | 0 |
| FlashSpeech: Efficient Zero-Shot Speech Synthesis | Apr 23, 2024 | RhythmSpeech Synthesis | CodeCode Available | 3 |