| XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model | Jun 7, 2024 | text-to-speechText to Speech | CodeCode Available | 1 |
| Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS | May 28, 2023 | Diversitytext-to-speech | CodeCode Available | 1 |
| YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone | Dec 4, 2021 | Speech SynthesisText-To-Speech Synthesis | CodeCode Available | 1 |
| An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS | Jun 25, 2025 | Speaker Recognitiontext-to-speech | —Unverified | 0 |
| kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech | Aug 20, 2024 | RetrievalSelf-Supervised Learning | —Unverified | 0 |
| Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech | Aug 28, 2023 | Domain Generalizationtext-to-speech | —Unverified | 0 |
| Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis | Jun 6, 2023 | Neural Renderingtext-to-speech | —Unverified | 0 |
| Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus | Mar 29, 2022 | text-to-speechText to Speech | —Unverified | 0 |