SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 776800 of 1419 papers

TitleStatusHype
AE-Flow: AutoEncoder Normalizing Flow0
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis0
A Framework for Synthetic Audio Conversations Generation using Large Language Models0
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer0
A Generative Model of a Pronunciation Lexicon for Hindi0
A Human-in-the-Loop Approach to Improving Cross-Text Prosody Transfer0
Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru0
AI-Powered Assistive Technologies for Visual Impairment0
A Language Modeling Approach to Diacritic-Free Hebrew TTS0
A Large-Scale User Study of an Alexa Prize Chatbot: Effect of TTS Dynamism on Perceived Quality of Social Dialog0
A learned conditional prior for the VAE acoustic space of a TTS system0
Aligner-Guided Training Paradigm: Advancing Text-to-Speech Models with Aligner Guided Duration0
Almost Unsupervised Text to Speech and Automatic Speech Recognition0
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input0
A Mask-based Model for Mandarin Chinese Polyphone Disambiguation0
A Melody-Unsupervision Model for Singing Voice Synthesis0
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach0
A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models0
A multilingual training strategy for low resource Text to Speech0
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge0
AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation0
An adaptable task-oriented dialog system for stand-alone embedded devices0
An Algorithm Based on Empirical Methods, for Text-to-Tuneful-Speech Synthesis of Sanskrit Verse0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Show:102550
← PrevPage 32 of 57Next →

No leaderboard results yet.