SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12761300 of 1419 papers

TitleStatusHype
Non-Autoregressive Neural Text-to-SpeechCode0
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systemsCode0
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network0
Almost Unsupervised Text to Speech and Automatic Speech Recognition0
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text0
The Zero Resource Speech Challenge 2019: TTS without T0
Expediting TTS Synthesis with Adversarial VocodingCode0
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning0
Building a mixed-lingual neural TTS system with only monolingual data0
Direct speech-to-speech translation with a sequence-to-sequence modelCode0
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogramCode0
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion0
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited DataCode1
Speech denoising by parametric resynthesis0
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworksCode0
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora0
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet0
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio AnalysisCode1
CSS10: A Collection of Single Speaker Speech Datasets for 10 LanguagesCode0
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis0
Deep Text-to-Speech System with Seq2Seq Model0
Data Efficient Voice Cloning for Neural Singing Synthesis0
End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition ModelCode1
Unsupervised Polyglot Text To Speech0
Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech0
Show:102550
← PrevPage 52 of 57Next →

No leaderboard results yet.