SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12511275 of 1419 papers

TitleStatusHype
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech ToolkitCode0
Location-Relative Attention Mechanisms For Robust Long-Form Speech SynthesisCode0
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR0
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach0
Semi-Supervised Generative Modeling for Controllable Speech Synthesis0
High Fidelity Speech Synthesis with Adversarial NetworksCode0
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
Modular Meta-Learning with Shrinkage0
Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs0
Neural Network-Based Modeling of Phonetic Durations0
A Large-Scale User Study of an Alexa Prize Chatbot: Effect of TTS Dynamism on Perceived Quality of Social Dialog0
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments0
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis0
From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories0
Numbers Normalisation in the Inflected Languages: a Case Study of PolishCode0
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the BibleCode0
Hierarchical Sequence to Sequence Voice Conversion with Limited Data0
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention0
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach0
A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples0
Fine-grained robust prosody transfer for single-speaker neural text-to-speech0
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features0
Improving Performance of End-to-End ASR on Numeric Sequences0
An adaptable task-oriented dialog system for stand-alone embedded devices0
Show:102550
← PrevPage 51 of 57Next →

No leaderboard results yet.