SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 426450 of 1419 papers

TitleStatusHype
Efficient Generative Modeling with Residual Vector Quantization-Based Tokens0
Efficient Incremental Text-to-Speech on GPUs0
DiffVoice: Text-to-Speech with Latent Diffusion0
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS0
ADEPT: A Dataset for Evaluating Prosody Transfer0
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech0
Efficient training strategies for natural sounding speech synthesis and speaker adaptation based on FastPitch0
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams0
ELLA-V: Stable Neural Codec Language Modeling with Alignment-guided Sequence Reordering0
AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles0
EmoCat: Language-agnostic Emotional Voice Conversion0
Auto Spell Suggestion for High Quality Speech Synthesis in Hindi0
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization0
An Expert System for Automatic Reading of A Text Written in Standard Arabic0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs0
Autoregressive Speech Synthesis without Vector Quantization0
AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration Modeling0
Autoregressive Speech Synthesis with Next-Distribution Prediction0
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
Show:102550
← PrevPage 18 of 57Next →

No leaderboard results yet.