Expressive Speech Synthesis

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–47 of 47 papers

Title	Date	Tasks	Status	Hype
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech	Jul 17, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions	Jun 3, 2025	Expressive Speech SynthesisPrompt Learning	—Unverified	0
RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations	May 24, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Gender Bias in Instruction-Guided Speech Synthesis Models	Feb 8, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Speech Synthesis along Perceptual Voice Quality Dimensions	Jan 15, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis	Jul 19, 2024	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings	Jul 19, 2024	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Articulatory Phonetics Informed Controllable Expressive Speech Synthesis	Jun 15, 2024	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Expressivity and Speech Synthesis	Apr 30, 2024	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning	Oct 26, 2023	Contrastive LearningExpressive Speech Synthesis	—Unverified	0
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis	Aug 31, 2023	Expressive Speech SynthesisSentence	—Unverified	0
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training	Jul 31, 2023	DenoisingExpressive Speech Synthesis	CodeCode Available	1
SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer	Jul 20, 2023	Expressive Speech SynthesisLanguage Modelling	CodeCode Available	1
Cross-lingual Prosody Transfer for Expressive Machine Dubbing	Jun 20, 2023	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels	May 22, 2023	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert	Apr 18, 2023	Audio GenerationExpressive Speech Synthesis	CodeCode Available	4
Ensemble prosody prediction for expressive speech synthesis	Apr 3, 2023	DiversityEnsemble Learning	—Unverified	0
On granularity of prosodic representations in expressive text-to-speech	Jan 26, 2023	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling	Nov 19, 2022	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis	Nov 2, 2022	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis	Nov 1, 2022	DisentanglementDiversity	—Unverified	0
Self-supervised Context-aware Style Representation for Expressive Speech Synthesis	Jun 25, 2022	Contrastive LearningDeep Clustering	—Unverified	0
Fine-grained Noise Control for Multispeaker Speech Synthesis	Apr 11, 2022	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis	Mar 23, 2022	Expressive Speech SynthesisKnowledge Distillation	—Unverified	0
Word-Level Style Control for Expressive, Non-attentive Speech Synthesis	Nov 19, 2021	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech	Oct 8, 2021	Emotion InterpretationExpressive Speech Synthesis	CodeCode Available	1
Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis	Sep 8, 2021	Expressive Speech SynthesisSentence	—Unverified	0
Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis	Jul 27, 2021	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
UniTTS: Residual Learning of Unified Embedding Space for Speech Style Control	Jun 21, 2021	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Towards Multi-Scale Style Control for Expressive Speech Synthesis	Apr 8, 2021	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System	Dec 1, 2020	ArticlesEmotional Speech Synthesis	—Unverified	0
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis	Sep 17, 2020	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning	Aug 20, 2020	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech	Nov 28, 2019	DisentanglementExpressive Speech Synthesis	—Unverified	0
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach	Oct 14, 2019	Expressive Speech SynthesisSociology	—Unverified	0
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis	Jun 8, 2019	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis	Mar 27, 2019	Emotional Speech SynthesisExpressive Speech Synthesis	CodeCode Available	1
Exploring Transfer Learning for Low Resource Emotional TTS	Jan 14, 2019	Deep LearningEmotional Speech Synthesis	CodeCode Available	1
Robust and fine-grained prosody control of end-to-end speech synthesis	Nov 6, 2018	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	0
SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis.	May 1, 2018	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder	Apr 6, 2018	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron	Mar 24, 2018	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Uncovering Latent Style Factors for Expressive Speech Synthesis	Nov 1, 2017	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM	Dec 1, 2016	Expressive Speech SynthesisSpeech Recognition	—Unverified	0
Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis	May 1, 2016	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
Alert!... Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis.	May 1, 2014	Expressive Speech SynthesisSentence	—Unverified	0
Evaluating expressive speech synthesis from audiobook corpora for conversational phrases	May 1, 2012	ClusteringExpressive Speech Synthesis	—Unverified	0

Show:10 25 50

No leaderboard results yet.