SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 11261150 of 1419 papers

TitleStatusHype
Denoising Text to Speech with Frame-Level Noise Modeling0
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal0
Using previous acoustic context to improve Text-to-Speech synthesis0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-AlignmentCode0
Text-to-speech for the hearing impaired0
GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis0
Vietnamese Text-To-Speech Shared Task VLSP 2020: Remaining problems with state-of-the-art techniques0
Improving prosodic phrasing of Vietnamese text-to-speech systems0
Development of Smartcall Vietnamese Text-to-Speech for VLSP 20200
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge0
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet VocoderCode0
Deep Shallow Fusion for RNN-T Personalization0
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis0
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement0
Low-resource expressive text-to-speech using data augmentation0
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS0
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement0
Naturalization of Text by the Insertion of Pauses and Filler WordsCode0
Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis0
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech0
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time0
Show:102550
← PrevPage 46 of 57Next →

No leaderboard results yet.