SOTAVerified

Emotional Speech Synthesis

Papers

Showing 125 of 26 papers

TitleStatusHype
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-SpeechCode2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical VectorCode2
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech SynthesisCode1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
Exploring Transfer Learning for Low Resource Emotional TTSCode1
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization0
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech0
Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech0
GANtron: Emotional Speech Synthesis with Generative Adversarial Networks0
Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition0
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis0
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis0
Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System0
Speech Synthesis with Mixed Emotions0
Versatile Speech Databases for High Quality Synthesis for Basque0
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training0
An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System0
ASR-based Features for Emotion Recognition: A Transfer Learning Approach0
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis0
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.