SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 876900 of 1419 papers

TitleStatusHype
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis0
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners0
Automatic Speech Recognition for Hindi0
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech0
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis0
Autoregressive Speech Synthesis with Next-Distribution Prediction0
Autoregressive Speech Synthesis without Vector Quantization0
Auto Spell Suggestion for High Quality Speech Synthesis in Hindi0
AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis0
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data0
LAraBench: Benchmarking Arabic AI with Large Language Models0
Benchmarking Expressive Japanese Character Text-to-Speech with VITS and Style-BERT-VITS20
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model0
Beyond Text-to-Text: An Overview of Multimodal and Generative Artificial Intelligence for Education Using Topic Modeling0
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing0
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation0
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization0
Boosting Diffusion Model for Spectrogram Up-sampling in Text-to-speech: An Empirical Study0
Boosting Large Language Model for Speech Synthesis: An Empirical Study0
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech0
Show:102550
← PrevPage 36 of 57Next →

No leaderboard results yet.