SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 101125 of 1419 papers

TitleStatusHype
Differentiable Reward Optimization for LLM based TTS systemCode2
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech SynthesisCode2
Generative Modeling for Low Dimensional Speech Attributes with Neural Spline FlowsCode2
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier TransformCode2
TTSDS -- Text-to-Speech Distribution ScoreCode2
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific FactorsCode2
Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech SynthesisCode2
End to End Lip Synchronization with a Temporal AutoEncoderCode1
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style ConversionCode1
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to SpeechCode1
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis DatasetCode1
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to SpeechCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
Improving TTS for Shanghainese: Addressing Tone Sandhi via Word SegmentationCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
End-to-End Adversarial Text-to-SpeechCode1
Emotion-Aware Prosodic Phrasing for Expressive Text-to-SpeechCode1
EfficientSpeech: An On-Device Text to Speech ModelCode1
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnetCode1
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited DataCode1
End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition ModelCode1
KazEmoTTS: A Dataset for Kazakh Emotional Text-to-Speech SynthesisCode1
Show:102550
← PrevPage 5 of 57Next →

No leaderboard results yet.