SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 451475 of 1419 papers

TitleStatusHype
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition0
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
Emphasis control for parallel neural TTS0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis0
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition0
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering0
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis0
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
Efficient training strategies for natural sounding speech synthesis and speaker adaptation based on FastPitch0
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator0
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech0
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck0
Adversarial speech for voice privacy protection from Personalized Speech generation0
End-to-end speech recognition modeling from de-identified data0
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue0
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning0
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
Show:102550
← PrevPage 19 of 57Next →

No leaderboard results yet.