SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 501525 of 1419 papers

TitleStatusHype
Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody ModellingCode2
Crowdsourced and Automatic Speech Prominence EstimationCode1
On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition0
Prosody Analysis of AudiobooksCode0
Neutral TTS Female Voice Corpus in Brazilian Portuguese0
Unified speech and gesture synthesis using flow matching0
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset0
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis0
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains0
Towards human-like spoken dialogue generation between AI agents from written dialogue0
Evaluating Speech Synthesis by Training Recognizers on Synthetic SpeechCode1
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS0
Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features0
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models0
Face-StyleSpeech: Enhancing Zero-shot Speech Synthesis from Face Images with Improved Face-to-Speech Mapping0
BiSinger: Bilingual Singing Voice SynthesisCode1
VoiceLDM: Text-to-Speech with Environmental Context0
DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis0
Emotion-Aware Prosodic Phrasing for Expressive Text-to-SpeechCode1
The Impact of Silence on Speech Anti-Spoofing0
Speak While You Think: Streaming Speech Synthesis During Text Generation0
Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language ModelCode1
Exploring Speech Enhancement for Low-resource Speech Synthesis0
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition0
Show:102550
← PrevPage 21 of 57Next →

No leaderboard results yet.