SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12261250 of 1419 papers

TitleStatusHype
Teacher-Student Training for Robust Tacotron-based TTS0
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework0
A System for Diacritizing Four Varieties of Arabic0
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech SynthesisCode0
Unsupervised pre-training for sequence to sequence speech recognition0
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment0
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency0
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogramCode2
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech ToolkitCode0
Location-Relative Attention Mechanisms For Robust Long-Form Speech SynthesisCode0
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR0
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach0
Semi-Supervised Generative Modeling for Controllable Speech Synthesis0
High Fidelity Speech Synthesis with Adversarial NetworksCode0
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
Modular Meta-Learning with Shrinkage0
Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs0
Neural Network-Based Modeling of Phonetic Durations0
A Large-Scale User Study of an Alexa Prize Chatbot: Effect of TTS Dynamism on Perceived Quality of Social Dialog0
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments0
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis0
From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories0
Numbers Normalisation in the Inflected Languages: a Case Study of PolishCode0
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the BibleCode0
Show:102550
← PrevPage 50 of 57Next →

No leaderboard results yet.