SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 10761100 of 1419 papers

TitleStatusHype
Unified Mandarin TTS Front-end Based on Distilled BERT ModelCode1
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention0
Denoising Text to Speech with Frame-Level Noise Modeling0
Parallel WaveNet conditioned on VAE latent vectors0
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal0
Using previous acoustic context to improve Text-to-Speech synthesis0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-AlignmentCode0
GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis0
Text-to-speech for the hearing impaired0
Vietnamese Text-To-Speech Shared Task VLSP 2020: Remaining problems with state-of-the-art techniques0
Development of Smartcall Vietnamese Text-to-Speech for VLSP 20200
Improving prosodic phrasing of Vietnamese text-to-speech systems0
Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph EntitiesCode1
FBWave: Efficient and Scalable Neural Vocoders for Streaming Text-To-Speech on the Edge0
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet VocoderCode0
Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple DomainsCode1
Deep Shallow Fusion for RNN-T Personalization0
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis0
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement0
Low-resource expressive text-to-speech using data augmentation0
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS0
Show:102550
← PrevPage 44 of 57Next →

No leaderboard results yet.