SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 251275 of 1419 papers

TitleStatusHype
FastSpeech 2: Fast and High-Quality End-to-End Text to SpeechCode1
End-to-End Adversarial Text-to-SpeechCode1
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment SearchCode1
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech SynthesisCode1
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback ConstraintCode1
Transformer based Grapheme-to-Phoneme ConversionCode1
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark DatasetCode1
Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0Code1
Semi-Supervised Neural Architecture SearchCode1
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech PretrainingCode1
Attention model for articulatory features detectionCode1
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited DataCode1
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition ModelCode1
Robust universal neural vocodingCode1
ClariNet: Parallel Wave Generation in End-to-End Text-to-SpeechCode1
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided AttentionCode1
VoiceLoop: Voice Fitting and Synthesis via a Phonological LoopCode1
Tacotron: Towards End-to-End Speech SynthesisCode1
WaveNet: A Generative Model for Raw AudioCode1
Hear Your Code Fail, Voice-Assisted Debugging for Python0
NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech0
P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Show:102550
← PrevPage 11 of 57Next →

No leaderboard results yet.