| Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert | Apr 18, 2023 | Audio GenerationExpressive Speech Synthesis | CodeCode Available | 4 | 5 |
| Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings | Jul 19, 2024 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 | 5 |
| DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial Training | Jul 31, 2023 | DenoisingExpressive Speech Synthesis | CodeCode Available | 1 | 5 |
| EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels | May 22, 2023 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 | 5 |
| Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech | Oct 8, 2021 | Emotion InterpretationExpressive Speech Synthesis | CodeCode Available | 1 | 5 |
| Laughter Synthesis: Combining Seq2seq modeling with Transfer Learning | Aug 20, 2020 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 | 5 |
| Articulatory Phonetics Informed Controllable Expressive Speech Synthesis | Jun 15, 2024 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 | 5 |
| Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis | Jun 8, 2019 | Expressive Speech SynthesisSpeech Synthesis | CodeCode Available | 1 | 5 |
| Exploring Transfer Learning for Low Resource Emotional TTS | Jan 14, 2019 | Deep LearningEmotional Speech Synthesis | CodeCode Available | 1 | 5 |
| SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer | Jul 20, 2023 | Expressive Speech SynthesisLanguage Modelling | CodeCode Available | 1 | 5 |