SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 251275 of 1419 papers

TitleStatusHype
Dreamento: an open-source dream engineering toolbox for sleep EEG wearablesCode1
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTSCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Pretraining Techniques for Sequence-to-Sequence Voice ConversionCode1
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech ModelCode1
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-SpeechCode1
Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and MaliseetCode1
RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech SynthesisCode1
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to SpeechCode1
Deep Learning Based Assessment of Synthetic Speech NaturalnessCode1
Robust universal neural vocodingCode1
RyanSpeech: A Corpus for Conversational Text-to-Speech SynthesisCode1
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-SpeechCode1
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-SpeechCode1
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
Crowdsourced and Automatic Speech Prominence EstimationCode1
ArTST: Arabic Text and Speech TransformerCode1
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found DataCode1
Semi-Supervised Neural Architecture SearchCode1
Text + Sketch: Image Compression at Ultra Low RatesCode1
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022Code0
PromptTTS: Controllable Text-to-Speech with Text DescriptionsCode0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake DatasetCode0
Predicting distributions with Linearizing Belief NetworksCode0
Show:102550
← PrevPage 11 of 57Next →

No leaderboard results yet.