SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 751800 of 1419 papers

TitleStatusHype
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
A Context-Based Numerical Format Prediction for a Text-To-Speech System0
A Corpus of Neutral Voice Speech in Brazilian Portuguese0
A Cost Efficient Approach to Correct OCR Errors in Large Document Collections0
Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning0
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System0
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN0
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data0
Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers0
Adapting TTS models For New Speakers using Transfer Learning0
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification0
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style0
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios0
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis0
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
ADEPT: A Dataset for Evaluating Prosody Transfer0
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data0
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters0
Empowering Communication: Speech Technology for Indian and Western Accents through AI-powered Speech Synthesis0
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset0
Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System0
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
Adversarial speech for voice privacy protection from Personalized Speech generation0
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting0
台語古詩朗誦系統A Taiwanese Text-to-Speech System for Ancient Poems[In Chinese]0
AE-Flow: AutoEncoder Normalizing Flow0
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis0
A Framework for Synthetic Audio Conversations Generation using Large Language Models0
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer0
A Generative Model of a Pronunciation Lexicon for Hindi0
A Human-in-the-Loop Approach to Improving Cross-Text Prosody Transfer0
Ain't Misbehavin' -- Using LLMs to Generate Expressive Robot Behavior in Conversations with the Tabletop Robot Haru0
AI-Powered Assistive Technologies for Visual Impairment0
A Language Modeling Approach to Diacritic-Free Hebrew TTS0
A Large-Scale User Study of an Alexa Prize Chatbot: Effect of TTS Dynamism on Perceived Quality of Social Dialog0
A learned conditional prior for the VAE acoustic space of a TTS system0
Aligner-Guided Training Paradigm: Advancing Text-to-Speech Models with Aligner Guided Duration0
Almost Unsupervised Text to Speech and Automatic Speech Recognition0
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input0
A Mask-based Model for Mandarin Chinese Polyphone Disambiguation0
A Melody-Unsupervision Model for Singing Voice Synthesis0
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach0
A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models0
A multilingual training strategy for low resource Text to Speech0
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge0
AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation0
An adaptable task-oriented dialog system for stand-alone embedded devices0
An Algorithm Based on Empirical Methods, for Text-to-Tuneful-Speech Synthesis of Sanskrit Verse0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Show:102550
← PrevPage 16 of 29Next →

No leaderboard results yet.