SOTAVerified

Emotional Speech Synthesis

Papers

Showing 125 of 26 papers

TitleStatusHype
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech0
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical VectorCode2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech0
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization0
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-SpeechCode2
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis0
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis0
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis0
Speech Synthesis with Mixed Emotions0
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech SynthesisCode1
GANtron: Emotional Speech Synthesis with Generative Adversarial Networks0
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System0
Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition0
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training0
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
Exploring Transfer Learning for Low Resource Emotional TTSCode1
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis0
An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System0
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System0
ASR-based Features for Emotion Recognition: A Transfer Learning Approach0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.