| Visual-Aware Text-to-Speech | Jun 21, 2023 | RhythmSpeech Synthesis | —Unverified | 0 |
| Automatic Speech Disentanglement for Voice Conversion using Rank Module and Speech Augmentation | Jun 21, 2023 | DisentanglementRhythm | —Unverified | 0 |
| MSW-Transformer: Multi-Scale Shifted Windows Transformer Networks for 12-Lead ECG Classification | Jun 21, 2023 | ClassificationDiagnostic | —Unverified | 0 |
| PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation | Jun 14, 2023 | DenoisingRhythm | CodeCode Available | 0 |
| Dance Generation by Sound Symbolic Words | Jun 6, 2023 | DiversityRhythm | —Unverified | 0 |
| BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion | Jun 5, 2023 | Rhythm | —Unverified | 0 |
| Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis | Jun 5, 2023 | RhythmSentence | —Unverified | 0 |
| ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer | May 22, 2023 | DecoderDenoising | —Unverified | 0 |
| ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios | May 20, 2023 | Rhythmtext-to-speech | —Unverified | 0 |
| Deep Learning-based Prediction of Electrical Arrhythmia Circuits from Cardiac Motion: An In-Silico Study | May 13, 2023 | DiagnosticRhythm | —Unverified | 0 |