SOTAVerified

Expressive Speech Synthesis

Papers

Showing 125 of 47 papers

TitleStatusHype
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained HubertCode4
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource SettingsCode1
Articulatory Phonetics Informed Controllable Expressive Speech SynthesisCode1
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
SC VALL-E: Style-Controllable Zero-Shot Text to Speech SynthesizerCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-SpeechCode1
Laughter Synthesis: Combining Seq2seq modeling with Transfer LearningCode1
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech SynthesisCode1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
Exploring Transfer Learning for Low Resource Emotional TTSCode1
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with TacotronCode1
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech0
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions0
RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations0
Gender Bias in Instruction-Guided Speech Synthesis Models0
Speech Synthesis along Perceptual Voice Quality Dimensions0
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis0
Expressivity and Speech Synthesis0
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning0
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis0
Cross-lingual Prosody Transfer for Expressive Machine Dubbing0
Ensemble prosody prediction for expressive speech synthesis0
On granularity of prosodic representations in expressive text-to-speech0
Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.