SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 676700 of 1419 papers

TitleStatusHype
Improving Deliberation by Text-Only and Semi-Supervised Training0
HMM-based data augmentation for E2E systems for building conversational speech synthesis systems0
Improving Grapheme-to-Phoneme Conversion through In-Context Knowledge Retrieval with Large Language Models0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS0
Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network0
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information0
Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows0
Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising0
Improving Performance of End-to-End ASR on Numeric Sequences0
Improving prosodic phrasing of Vietnamese text-to-speech systems0
Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis0
Improving Readability for Automatic Speech Recognition Transcription0
HLTCOE JHU Submission to the Voice Privacy Challenge 20240
Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment0
Improving Speech-to-Speech Translation Through Unlabeled Text0
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows0
Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling0
Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model0
Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system0
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis0
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI0
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time0
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency0
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units0
Show:102550
← PrevPage 28 of 57Next →

No leaderboard results yet.