| EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector | Nov 4, 2024 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 |
| EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control | Oct 1, 2024 | Emotional Speech SynthesisSpeech Synthesis | CodeCode Available | 2 |
| EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech | Jun 12, 2024 | Emotional Speech Synthesistext-to-speech | CodeCode Available | 2 |
| OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis | Jan 8, 2025 | DecoderEmotional Speech Synthesis | CodeCode Available | 2 |
| Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis | Mar 27, 2019 | Emotional Speech SynthesisExpressive Speech Synthesis | CodeCode Available | 1 |
| Exploring Transfer Learning for Low Resource Emotional TTS | Jan 14, 2019 | Deep LearningEmotional Speech Synthesis | CodeCode Available | 1 |
| StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis | Oct 7, 2021 | AttributeData Augmentation | CodeCode Available | 1 |
| ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis | Jan 16, 2024 | DenoisingEmotional Speech Synthesis | —Unverified | 0 |
| DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech | May 26, 2025 | AttributeEmotional Speech Synthesis | —Unverified | 0 |
| ASR-based Features for Emotion Recognition: A Transfer Learning Approach | May 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |