SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 951975 of 1419 papers

TitleStatusHype
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading0
Contextual Expressive Text-to-Speech0
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory0
Continual Speaker Adaptation for Text-to-Speech Synthesis0
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM0
Continuous Speech Synthesis using per-token Latent Diffusion0
Controllable Accented Text-to-Speech Synthesis0
Controllable Emphasis with zero data for text-to-speech0
Controllable neural text-to-speech synthesis using intuitive prosodic features0
Controllable speech synthesis by learning discrete phoneme-level prosodic representations0
Controlling Emotion in Text-to-Speech with Natural Language Prompts0
Controllable Prosody Generation With Partial Inputs0
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation0
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech0
Corpus Generation for Voice Command in Smart Home and the Effect of Speech Synthesis on End-to-End SLU0
Counterfactual Activation Editing for Post-hoc Prosody and Mispronunciation Correction in TTS Models0
Learning Speech Representation From Contrastive Token-Acoustic Pretraining0
Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations0
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform0
Creating New Voices using Normalizing Flows0
Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT0
Cross-Domain Audio Deepfake Detection: Dataset and Analysis0
Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech0
Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers0
Show:102550
← PrevPage 39 of 57Next →

No leaderboard results yet.