SOTAVerified

Expressive Speech Synthesis

Papers

Showing 125 of 47 papers

TitleStatusHype
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained HubertCode4
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech SynthesisCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-SpeechCode1
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with TacotronCode1
Laughter Synthesis: Combining Seq2seq modeling with Transfer LearningCode1
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource SettingsCode1
Articulatory Phonetics Informed Controllable Expressive Speech SynthesisCode1
Exploring Transfer Learning for Low Resource Emotional TTSCode1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
SC VALL-E: Style-Controllable Zero-Shot Text to Speech SynthesizerCode1
Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis0
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis0
Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling0
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning0
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech0
On granularity of prosodic representations in expressive text-to-speech0
Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis0
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions0
RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations0
Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis0
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis0
Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System0
Speech Synthesis along Perceptual Voice Quality Dimensions0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.