| Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert | Apr 18, 2023 | Audio GenerationExpressive Speech Synthesis | CodeCode Available | 4 |
| Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis | Jun 8, 2019 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 |
| EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels | May 22, 2023 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 |
| Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech | Oct 8, 2021 | Emotion InterpretationExpressive Speech Synthesis | CodeCode Available | 1 |
| Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron | Mar 24, 2018 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 |
| Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning | Aug 20, 2020 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 |
| Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings | Jul 19, 2024 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 |
| Articulatory Phonetics Informed Controllable Expressive Speech Synthesis | Jun 15, 2024 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 |
| Exploring Transfer Learning for Low Resource Emotional TTS | Jan 14, 2019 | Deep LearningEmotional Speech Synthesis | CodeCode Available | 1 |
| Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis | Mar 27, 2019 | Emotional Speech SynthesisExpressive Speech Synthesis | CodeCode Available | 1 |
| DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training | Jul 31, 2023 | DenoisingExpressive Speech Synthesis | CodeCode Available | 1 |
| SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer | Jul 20, 2023 | Expressive Speech SynthesisLanguage Modelling | CodeCode Available | 1 |
| Learning utterance-level representations through token-level acoustic latents prediction for Expressive Speech Synthesis | Nov 1, 2022 | DisentanglementDiversity | —Unverified | 0 |
| MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis | Jul 19, 2024 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Multi-Speaker Expressive Speech Synthesis via Multiple Factors Decoupling | Nov 19, 2022 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning | Oct 26, 2023 | Contrastive LearningExpressive Speech Synthesis | —Unverified | 0 |
| NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech | Jul 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On granularity of prosodic representations in expressive text-to-speech | Jan 26, 2023 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Predicting phoneme-level prosody latents using AR and flow-based Prior Networks for expressive speech synthesis | Nov 2, 2022 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions | Jun 3, 2025 | Expressive Speech SynthesisPrompt Learning | —Unverified | 0 |
| RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations | May 24, 2025 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis | Sep 8, 2021 | Expressive Speech SynthesisSentence | —Unverified | 0 |
| Self-supervised Context-aware Style Representation for Expressive Speech Synthesis | Jun 25, 2022 | Contrastive LearningDeep Clustering | —Unverified | 0 |
| Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System | Dec 1, 2020 | ArticlesEmotional Speech Synthesis | —Unverified | 0 |
| Speech Synthesis along Perceptual Voice Quality Dimensions | Jan 15, 2025 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis. | May 1, 2018 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach | Oct 14, 2019 | Expressive Speech SynthesisSociology | —Unverified | 0 |
| Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis | Mar 23, 2022 | Expressive Speech SynthesisKnowledge Distillation | —Unverified | 0 |
| Towards Multi-Scale Style Control for Expressive Speech Synthesis | Apr 8, 2021 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis | Aug 31, 2023 | Expressive Speech SynthesisSentence | —Unverified | 0 |
| Uncovering Latent Style Factors for Expressive Speech Synthesis | Nov 1, 2017 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| UniTTS: Residual Learning of Unified Embedding Space for Speech Style Control | Jun 21, 2021 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech | Nov 28, 2019 | DisentanglementExpressive Speech Synthesis | —Unverified | 0 |
| Word-Level Style Control for Expressive, Non-attentive Speech Synthesis | Nov 19, 2021 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis | May 1, 2016 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM | Dec 1, 2016 | Expressive Speech SynthesisSpeech Recognition | —Unverified | 0 |
| Cross-lingual Prosody Transfer for Expressive Machine Dubbing | Jun 20, 2023 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis | Jul 27, 2021 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Ensemble prosody prediction for expressive speech synthesis | Apr 3, 2023 | DiversityEnsemble Learning | —Unverified | 0 |
| Evaluating expressive speech synthesis from audiobook corpora for conversational phrases | May 1, 2012 | ClusteringExpressive Speech Synthesis | —Unverified | 0 |
| Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder | Apr 6, 2018 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Expressivity and Speech Synthesis | Apr 30, 2024 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Fine-grained Noise Control for Multispeaker Speech Synthesis | Apr 11, 2022 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Gender Bias in Instruction-Guided Speech Synthesis Models | Feb 8, 2025 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis | Sep 17, 2020 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Alert!... Calm Down, There is Nothing to Worry About. Warning and Soothing Speech Synthesis. | May 1, 2014 | Expressive Speech SynthesisSentence | —Unverified | 0 |
| Robust and fine-grained prosody control of end-to-end speech synthesis | Nov 6, 2018 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 0 |