SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 326350 of 1419 papers

TitleStatusHype
台語古詩朗誦系統A Taiwanese Text-to-Speech System for Ancient Poems[In Chinese]0
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs0
BUCEADOR, a multi-language search engine for digital libraries0
基於字元階層之語音合成用文脈訊息擷取 (Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting0
Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech0
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck0
Adversarial speech for voice privacy protection from Personalized Speech generation0
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
Boosting Large Language Model for Speech Synthesis: An Empirical Study0
An overview of text-to-speech systems and media applications0
Boosting Diffusion Model for Spectrogram Up-sampling in Text-to-speech: An Empirical Study0
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation0
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
Show:102550
← PrevPage 14 of 57Next →

No leaderboard results yet.