SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 676700 of 1419 papers

TitleStatusHype
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition0
Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages0
A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous SpeechCode2
MAC: A unified framework boosting low resource automatic speech recognition0
UzbekTagger: The rule-based POS tagger for Uzbek language0
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text PretrainingCode1
Time out of Mind: Generating Rate of Speech conditioned on emotion and speakerCode0
On granularity of prosodic representations in expressive text-to-speech0
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case StudyCode0
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme PredictionsCode5
Modelling low-resource accents without accent-specific TTS frontend0
UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion0
Applying Automated Machine Translation to Educational Video Courses0
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition0
Neural Codec Language Models are Zero-Shot Text to Speech SynthesizersCode7
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to SpeechCode1
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS ModelsCode1
HMM-based data augmentation for E2E systems for building conversational speech synthesis systems0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling0
TTS-Guided Training for Accent Conversion Without Parallel Data0
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder0
Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language0
Speech Aware Dialog System Technology Challenge (DSTC11)0
Show:102550
← PrevPage 28 of 57Next →

No leaderboard results yet.