| EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech | Jun 12, 2024 | Emotional Speech Synthesistext-to-speech | CodeCode Available | 2 | 5 |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Oct 1, 2024 | Emotional Speech SynthesisSpeech Synthesis | CodeCode Available | 2 | 5 |
| EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector | Nov 4, 2024 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 | 5 |
| OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis | Jan 8, 2025 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 | 5 |
| StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis | Oct 7, 2021 | AttributeData Augmentation | CodeCode Available | 1 | 5 |
| Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis | Mar 27, 2019 | Emotional Speech SynthesisExpressive Speech Synthesis | CodeCode Available | 1 | 5 |
| Exploring Transfer Learning for Low Resource Emotional TTS | Jan 14, 2019 | Deep LearningEmotional Speech Synthesis | CodeCode Available | 1 | 5 |
| Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization | Sep 16, 2024 | Emotional Speech SynthesisIn-Context Learning | —Unverified | 0 | 0 |
| EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model | Jun 17, 2021 | Emotional Speech SynthesisEmotion Classification | —Unverified | 0 | 0 |
| EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System | Jun 26, 2018 | Emotional Speech SynthesisParameter Prediction | —Unverified | 0 | 0 |
| UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech | May 15, 2025 | Emotional Speech SynthesisLanguage Modeling | —Unverified | 0 | 0 |
| Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech | Sep 24, 2024 | Emotional Speech SynthesisSpeech Synthesis | —Unverified | 0 | 0 |
| GANtron: Emotional Speech Synthesis with Generative Adversarial Networks | Oct 6, 2021 | Emotional Speech SynthesisSpeech Synthesis | —Unverified | 0 | 0 |
| Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition | Oct 25, 2020 | Data AugmentationEmotional Speech Synthesis | —Unverified | 0 | 0 |
| Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis | Oct 28, 2022 | DecoderDiversity | —Unverified | 0 | 0 |
| QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis | Mar 14, 2023 | Emotional Speech SynthesisSentence | —Unverified | 0 | 0 |
| Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System | Dec 1, 2020 | ArticlesEmotional Speech Synthesis | —Unverified | 0 | 0 |
| Speech Synthesis with Mixed Emotions | Aug 11, 2022 | AttributeEmotional Speech Synthesis | —Unverified | 0 | 0 |
| Versatile Speech Databases for High Quality Synthesis for Basque | May 1, 2012 | Emotional Speech SynthesisSpeech Synthesis | —Unverified | 0 | 0 |
| End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training | Jun 26, 2019 | Emotional Speech SynthesisEmotion Recognition | —Unverified | 0 | 0 |
| An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System | Jul 1, 2018 | Dialogue ManagementEmotional Speech Synthesis | —Unverified | 0 | 0 |
| ASR-based Features for Emotion Recognition: A Transfer Learning Approach | May 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations | Nov 11, 2022 | Emotional Speech SynthesisSpeech Synthesis | —Unverified | 0 | 0 |
| Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis | Jul 30, 2018 | Acoustic ModellingDecoder | —Unverified | 0 | 0 |
| DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech | May 26, 2025 | AttributeEmotional Speech Synthesis | —Unverified | 0 | 0 |
| ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis | Jan 16, 2024 | DenoisingEmotional Speech Synthesis | —Unverified | 0 | 0 |