SOTAVerified

Emotional Speech Synthesis

Papers

Showing 126 of 26 papers

TitleStatusHype
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical VectorCode2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion ControlCode2
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-SpeechCode2
StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech SynthesisCode1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
Exploring Transfer Learning for Low Resource Emotional TTSCode1
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech0
Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech0
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization0
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis0
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis0
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis0
Speech Synthesis with Mixed Emotions0
GANtron: Emotional Speech Synthesis with Generative Adversarial Networks0
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System0
Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition0
End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training0
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis0
An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System0
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System0
ASR-based Features for Emotion Recognition: A Transfer Learning Approach0
Versatile Speech Databases for High Quality Synthesis for Basque0
Show:102550

No leaderboard results yet.