SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 9761000 of 1419 papers

TitleStatusHype
Improving Cross-lingual Speech Synthesis with Triplet Training Scheme0
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation0
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech0
Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module0
Unsupervised word-level prosody tagging for controllable speech synthesis0
NewsPod: Automatic and Interactive News Podcasts0
Distribution augmentation for low-resource expressive text-to-speech0
Deep Performer: Score-to-Audio Music Performance Synthesis0
Cross-speaker style transfer for text-to-speech using data augmentation0
Building Synthetic Speaker Profiles in Text-to-Speech Systems0
Multi-Stage Deep Transfer Learning for EmIoT-enabled Human-Computer Interaction0
Transformer-based Models of Text Normalization for Speech Applications0
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
The MSXF TTS System for ICASSP 2022 ADD Challenge0
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention0
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end0
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics0
A Practical Guide to Logical Access Voice Presentation Attack Detection0
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architectureCode0
SoK: A Study of the Security on Voice Processing Systems0
Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios0
Multi-speaker Emotional Text-to-speech Synthesizer0
Show:102550
← PrevPage 40 of 57Next →

No leaderboard results yet.