| Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis | Aug 31, 2023 | Expressive Speech SynthesisSentence | —Unverified | 0 |
| Cross-lingual Prosody Transfer for Expressive Machine Dubbing | Jun 20, 2023 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Ensemble prosody prediction for expressive speech synthesis | Apr 3, 2023 | DiversityEnsemble Learning | —Unverified | 0 |
| On granularity of prosodic representations in expressive text-to-speech | Jan 26, 2023 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling | Nov 19, 2022 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis | Nov 2, 2022 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis | Nov 1, 2022 | DisentanglementDiversity | —Unverified | 0 |
| Self-supervised Context-aware Style Representation for Expressive Speech Synthesis | Jun 25, 2022 | Contrastive LearningDeep Clustering | —Unverified | 0 |
| Fine-grained Noise Control for Multispeaker Speech Synthesis | Apr 11, 2022 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis | Mar 23, 2022 | Expressive Speech SynthesisKnowledge Distillation | —Unverified | 0 |