SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 226250 of 1419 papers

TitleStatusHype
Attention model for articulatory features detectionCode1
Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech TranslationCode1
ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS AdaptationCode1
OverFlow: Putting flows on top of neural transducers for better TTSCode1
MathReader : Text-to-Speech for Mathematical DocumentsCode1
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation LearningCode1
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional FusionCode1
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTSCode1
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to SpeechCode1
Dreamento: an open-source dream engineering toolbox for sleep EEG wearablesCode1
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-SpeechCode1
EdiTTS: Score-based Editing for Controllable Text-to-SpeechCode1
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied BaselineCode1
PRESENT: Zero-Shot Text-to-Prosody ControlCode1
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found DataCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
Effective Deep Learning Models for Automatic Diacritization of Arabic TextCode1
Pretraining Techniques for Sequence-to-Sequence Voice ConversionCode1
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to SpeechCode1
Phonological Features for 0-shot Multilingual Speech SynthesisCode1
A Survey on Neural Speech SynthesisCode1
Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and MaliseetCode1
Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0Code1
Show:102550
← PrevPage 10 of 57Next →

No leaderboard results yet.