SOTAVerified

Expressive Speech Synthesis

Papers

Showing 125 of 47 papers

TitleStatusHype
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech0
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions0
RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations0
Gender Bias in Instruction-Guided Speech Synthesis Models0
Speech Synthesis along Perceptual Voice Quality Dimensions0
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis0
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource SettingsCode1
Articulatory Phonetics Informed Controllable Expressive Speech SynthesisCode1
Expressivity and Speech Synthesis0
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning0
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis0
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
SC VALL-E: Style-Controllable Zero-Shot Text to Speech SynthesizerCode1
Cross-lingual Prosody Transfer for Expressive Machine Dubbing0
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained HubertCode4
Ensemble prosody prediction for expressive speech synthesis0
On granularity of prosodic representations in expressive text-to-speech0
Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling0
Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis0
Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis0
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis0
Fine-grained Noise Control for Multispeaker Speech Synthesis0
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis0
Word-Level Style Control for Expressive, Non-attentive Speech Synthesis0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.