| CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment | Jun 25, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Let Your Video Listen to Your Music! | Jun 23, 2025 | GPUMusic Generation | —Unverified | 0 |
| From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training | Jun 20, 2025 | Music GenerationRhythm | CodeCode Available | 0 |
| DanceChat: Large Language Model-Guided Music-to-Dance Generation | Jun 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Rhythm Features for Speaker Identification | Jun 7, 2025 | Deep LearningRhythm | —Unverified | 0 |
| Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech | Jun 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment | Jun 1, 2025 | Classificationregression | CodeCode Available | 0 |
| Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching | Jun 1, 2025 | RhythmStyle Transfer | —Unverified | 0 |
| Source Tracing of Synthetic Speech Systems Through Paralinguistic Pre-Trained Representations | Jun 1, 2025 | Emotion RecognitionRhythm | —Unverified | 0 |
| Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism | Jun 1, 2025 | Rhythm | —Unverified | 0 |