SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 626650 of 1419 papers

TitleStatusHype
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance0
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech0
Harder or Different? Understanding Generalization of Audio Deepfake Detection0
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM0
Hear Your Code Fail, Voice-Assisted Debugging for Python0
Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech0
Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech0
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech0
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis0
Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control0
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis0
Hierarchical Representation of Prosody for Statistical Speech Synthesis0
Hierarchical Sequence to Sequence Voice Conversion with Limited Data0
Generative Semantic Communication for Text-to-Speech Synthesis0
Generative Pre-training for Speech with Flow Matching0
HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset0
Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models0
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement0
Highly Effective Arabic Diacritization using Sequence to Sequence Modeling0
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units0
Creating New Voices using Normalizing Flows0
Show:102550
← PrevPage 26 of 57Next →

No leaderboard results yet.