SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 201225 of 1419 papers

TitleStatusHype
Benchmarking Large Multimodal Models against Common CorruptionsCode1
Imaginary Voice: Face-styled Diffusion Model for Text-to-SpeechCode1
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided AttentionCode1
OverFlow: Putting flows on top of neural transducers for better TTSCode1
BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpusCode1
Effective Deep Learning Models for Automatic Diacritization of Arabic TextCode1
BiSinger: Bilingual Singing Voice SynthesisCode1
EdiTTS: Score-based Editing for Controllable Text-to-SpeechCode1
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
One-class learning towards generalized voice spoofing detectionCode1
Attention model for articulatory features detectionCode1
ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS AdaptationCode1
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis DatasetCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
Learning to Dub Movies via Hierarchical Prosody ModelsCode1
Dreamento: an open-source dream engineering toolbox for sleep EEG wearablesCode1
Brilla AI: AI Contestant for the National Science and Maths QuizCode1
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text PretrainingCode1
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTSCode1
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional FusionCode1
DiffProsody: Diffusion-based Latent Prosody Generation for Expressive Speech Synthesis with Prosody Conditional Adversarial TrainingCode1
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-SpeechCode1
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found DataCode1
Show:102550
← PrevPage 9 of 57Next →

No leaderboard results yet.