SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 701725 of 1419 papers

TitleStatusHype
RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech SynthesisCode1
Probing Deep Speaker Embeddings for Speaker-related Tasks0
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithmCode1
MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis DatasetCode1
SpeechLMScore: Evaluating speech generation using speech language modelCode1
Learning to Dub Movies via Hierarchical Prosody ModelsCode1
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning0
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech0
Controllable speech synthesis by learning discrete phoneme-level prosodic representations0
Evaluating and reducing the distance between synthetic and real speech distributions0
Contextual Expressive Text-to-Speech0
Efficient Incremental Text-to-Speech on GPUs0
IMaSC -- ICFOSS Malayalam Speech Corpus0
PromptTTS: Controllable Text-to-Speech with Text DescriptionsCode0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
Towards Building Text-To-Speech Systems for the Next Billion UsersCode2
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
SNIPER Training: Single-Shot Sparse Training for Text-to-Speech0
OverFlow: Putting flows on top of neural transducers for better TTSCode1
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
Accented Text-to-Speech Synthesis with a Conditional Variational AutoencoderCode1
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-SpeechCode6
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
Show:102550
← PrevPage 29 of 57Next →

No leaderboard results yet.