SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 551575 of 1419 papers

TitleStatusHype
DNN-based Speech Synthesis for Indian Languages from ASCII text0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data0
Distribution augmentation for low-resource expressive text-to-speech0
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing0
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech0
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS0
An In-depth Analysis of the Effect of Text Normalization in Social Media0
DiscreTalk: Text-to-Speech as a Machine Translation Problem0
Discovering the Italian literature: interactive access to audio indexed text resources0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
Direct Text to Speech Translation System using Acoustic Units0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data0
A Challenge Set and Methods for Noun-Verb Ambiguity0
Voice Impression Control in Zero-Shot TTS0
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition0
Direct Speech to Speech Translation: A Review0
Show:102550
← PrevPage 23 of 57Next →

No leaderboard results yet.