SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 901950 of 1419 papers

TitleStatusHype
Building Synthetic Speaker Profiles in Text-to-Speech Systems0
Multi-Stage Deep Transfer Learning for EmIoT-enabled Human-Computer Interaction0
Transformer-based Models of Text Normalization for Speech Applications0
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
The MSXF TTS System for ICASSP 2022 ADD Challenge0
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention0
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end0
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics0
A Practical Guide to Logical Access Voice Presentation Attack Detection0
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architectureCode0
SoK: A Study of the Security on Voice Processing Systems0
Multi-speaker Multi-style Text-to-speech Synthesis With Single-speaker Single-style Training Data Scenarios0
Multi-speaker Emotional Text-to-speech Synthesizer0
Speech-T: Transducer for Text to Speech and Beyond0
Generating Rich Product Descriptions for Conversational E-commerce Systems0
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnetCode1
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance0
Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control0
Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages0
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis0
More than Words: In-the-Wild Visually-Driven Prosody for Text-to-SpeechCode1
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency0
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech0
Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning0
Meta-Voice: Fast few-shot style transfer for expressive voice cloning using meta learning0
Speaker Generation0
Emotional Prosody Control for Speech Generation0
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-SpeechCode1
fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit0
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation0
ViDA-MAN: Visual Dialog with Digital Humans0
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021Code0
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech0
FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech DetectionCode1
ESPnet2-TTS: Extending the Edge of TTS Research0
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation0
Neural Dubber: Dubbing for Videos According to Scripts0
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech0
FedSpeech: Federated Text-to-Speech with Continual Learning0
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
Revisiting IPA-based Cross-lingual Text-to-speech0
Systematic Inequalities in Language Technology Performance across the World's LanguagesCode0
A Melody-Unsupervision Model for Singing Voice Synthesis0
Fine-grained style control in Transformer-based Text-to-speech SynthesisCode1
Adapting TTS models For New Speakers using Transfer Learning0
Towards Lifelong Learning of Multilingual Text-To-Speech SynthesisCode0
Show:102550
← PrevPage 19 of 29Next →

No leaderboard results yet.