SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 426450 of 1419 papers

TitleStatusHype
Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models0
AE-Flow: AutoEncoder Normalizing Flow0
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis0
Building a mixed-lingual neural TTS system with only monolingual data0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Emotional Prosody Control for Speech Generation0
EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis0
Building a Luganda Text-to-Speech Model From Crowdsourced Data0
基於字元階層之語音合成用文脈訊息擷取(Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
台語古詩朗誦系統A Taiwanese Text-to-Speech System for Ancient Poems[In Chinese]0
A Context-Based Numerical Format Prediction for a Text-To-Speech System0
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge0
BUCEADOR, a multi-language search engine for digital libraries0
EmoSpeech: A Corpus of Emotionally Rich and Contextually Detailed Speech Annotations0
基於字元階層之語音合成用文脈訊息擷取 (Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization0
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting0
EmoCat: Language-agnostic Emotional Voice Conversion0
Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation0
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
Show:102550
← PrevPage 18 of 57Next →

No leaderboard results yet.