SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 401425 of 1419 papers

TitleStatusHype
DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech0
DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction0
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
Dual Script E2E framework for Multilingual and Code-Switching ASR0
DualSpeech: Enhancing Speaker-Fidelity and Text-Intelligibility Through Dual Classifier-Free Guidance0
Dual Supervised Learning0
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing0
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech0
Emphasis control for parallel neural TTS0
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis0
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection0
Direct Speech to Speech Translation: A Review0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
Digital Einstein Experience: Fast Text-to-Speech for Conversational AI0
DiffVoice: Text-to-Speech with Latent Diffusion0
ADEPT: A Dataset for Evaluating Prosody Transfer0
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech0
AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis0
Effective Decoder Masking for Transformer Based End-to-End Speech Recognition0
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles0
Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition0
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization0
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment0
Auto Spell Suggestion for High Quality Speech Synthesis in Hindi0
Show:102550
← PrevPage 17 of 57Next →

No leaderboard results yet.