SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12261250 of 1419 papers

TitleStatusHype
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?0
Can we reconstruct a dysarthric voice with the large speech model Parler TTS?0
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data0
CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech0
CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment0
ChatAnything: Facetime Chat with LLM-Enhanced Personas0
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network0
CHULA TTS: A Modularized Text-To-Speech Framework0
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech0
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus0
Clip-TTS: Contrastive Text-content and Mel-spectrogram, A High-Quality Text-to-Speech Method based on Contextual Semantic Understanding0
CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning0
CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages0
Code-Mixed Text to Speech Synthesis under Low-Resource Constraints0
Code-Switching Text Generation and Injection in Mandarin-English ASR0
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling0
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection0
Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis0
ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios0
Compact Neural TTS Voices for Accessibility0
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset0
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech0
Show:102550
← PrevPage 50 of 57Next →

No leaderboard results yet.