SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 10011025 of 1419 papers

TitleStatusHype
A Survey on Neural Speech SynthesisCode1
FastPitchFormant: Source-filter based Decomposed Modeling for Speech SynthesisCode1
GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis0
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech0
Non-native English lexicon creation for bilingual speech synthesis0
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters0
WaveGrad 2: Iterative Refinement for Text-to-Speech SynthesisCode1
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows0
ADEPT: A Dataset for Evaluating Prosody Transfer0
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis0
RyanSpeech: A Corpus for Conversational Text-to-Speech SynthesisCode1
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform GenerationCode3
A learned conditional prior for the VAE acoustic space of a TTS system0
SynthASR: Unlocking Synthetic Data for Speech Recognition0
HUI-Audio-Corpus-German: A high quality TTS datasetCode1
Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-based Multi-modal Context ModelingCode1
Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows0
Speech BERT Embedding For Improving Prosody in Neural TTS0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech GenerationCode1
Reinforce-Aligner: Reinforcement Alignment Search for Robust End-to-End Text-to-Speech0
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis0
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis0
Dual Script E2E framework for Multilingual and Code-Switching ASR0
Show:102550
← PrevPage 41 of 57Next →

No leaderboard results yet.