SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 201225 of 1419 papers

TitleStatusHype
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-SpeechCode1
FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech DetectionCode1
Fine-grained style control in Transformer-based Text-to-speech SynthesisCode1
Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-SpeechCode1
Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddingsCode1
EdiTTS: Score-based Editing for Controllable Text-to-SpeechCode1
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio NarrationCode1
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021Code1
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice ConversionCode1
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional FusionCode1
FastPitchFormant: Source-filter based Decomposed Modeling for Speech SynthesisCode1
A Survey on Neural Speech SynthesisCode1
WaveGrad 2: Iterative Refinement for Text-to-Speech SynthesisCode1
RyanSpeech: A Corpus for Conversational Text-to-Speech SynthesisCode1
Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-based Multi-modal Context ModelingCode1
HUI-Audio-Corpus-German: A high quality TTS datasetCode1
Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech GenerationCode1
Grad-TTS: A Diffusion Probabilistic Model for Text-to-SpeechCode1
Wav2KWS: Transfer Learning from Speech Representations for Keyword SpottingCode1
Deep Learning Based Assessment of Synthetic Speech NaturalnessCode1
AdaSpeech 2: Adaptive Text to Speech with Untranscribed DataCode1
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis DatasetCode1
TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration PredictionCode1
Proteno: Text Normalization with Limited Data for Fast Deployment in Text to Speech SystemsCode1
A Toolbox for Construction and Analysis of Speech DatasetsCode1
Show:102550
← PrevPage 9 of 57Next →

No leaderboard results yet.