SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12511275 of 1419 papers

TitleStatusHype
Hierarchical Sequence to Sequence Voice Conversion with Limited Data0
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention0
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice CloningCode3
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach0
A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples0
Fine-grained robust prosody transfer for single-speaker neural text-to-speech0
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features0
Attention model for articulatory features detectionCode1
An adaptable task-oriented dialog system for stand-alone embedded devices0
Improving Performance of End-to-End ASR on Numeric Sequences0
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis0
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models0
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling0
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise0
Using generative modelling to produce varied intonation for speech synthesisCode0
Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods0
MelNet: A Generative Model for Audio in the Frequency DomainCode0
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain0
Neural Models of Text Normalization for Speech Applications0
Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language0
Highly Effective Arabic Diacritization using Sequence to Sequence Modeling0
Neural Text Normalization with Subword Units0
A Cost Efficient Approach to Correct OCR Errors in Large Document Collections0
FastSpeech: Fast,Robustand Controllable Text-to-SpeechCode2
FastSpeech: Fast, Robust and Controllable Text to SpeechCode2
Show:102550
← PrevPage 51 of 57Next →

No leaderboard results yet.