SOTAVerified

Expressive Speech Synthesis

Papers

Showing 125 of 47 papers

TitleStatusHype
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained HubertCode4
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-SpeechCode1
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with TacotronCode1
Laughter Synthesis: Combining Seq2seq modeling with Transfer LearningCode1
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource SettingsCode1
Articulatory Phonetics Informed Controllable Expressive Speech SynthesisCode1
Exploring Transfer Learning for Low Resource Emotional TTSCode1
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech SynthesisCode1
SC VALL-E: Style-Controllable Zero-Shot Text to Speech SynthesizerCode1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
Robust and fine-grained prosody control of end-to-end speech synthesisCode0
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis0
Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling0
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning0
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech0
On granularity of prosodic representations in expressive text-to-speech0
Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis0
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions0
RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations0
Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis0
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis0
Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System0
Speech Synthesis along Perceptual Voice Quality Dimensions0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.