SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 10011050 of 1419 papers

TitleStatusHype
Speech-T: Transducer for Text to Speech and Beyond0
Generating Rich Product Descriptions for Conversational E-commerce Systems0
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance0
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis0
Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control0
Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages0
High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency0
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech0
Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning0
Meta-Voice: Fast few-shot style transfer for expressive voice cloning using meta learning0
Speaker Generation0
Emotional Prosody Control for Speech Generation0
fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit0
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation0
ViDA-MAN: Visual Dialog with Digital Humans0
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021Code0
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech0
ESPnet2-TTS: Extending the Edge of TTS Research0
Neural Dubber: Dubbing for Videos According to Scripts0
From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation0
Exploring Timbre Disentanglement in Non-Autoregressive Cross-Lingual Text-to-Speech0
Revisiting IPA-based Cross-lingual Text-to-speech0
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation0
FedSpeech: Federated Text-to-Speech with Continual Learning0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
A Melody-Unsupervision Model for Singing Voice Synthesis0
Systematic Inequalities in Language Technology Performance across the World's LanguagesCode0
Adapting TTS models For New Speakers using Transfer Learning0
Towards Lifelong Learning of Multilingual Text-To-Speech SynthesisCode0
A study on the efficacy of model pre-training in developing neural text-to-speech system0
Environment Aware Text-to-Speech Synthesis0
Applying Phonological Features in Multilingual Text-To-SpeechCode0
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over0
Prosody-TTS: An end-to-end speech synthesis system with prosody control0
Hierarchical prosody modeling and control in non-autoregressive parallel neural TTS0
GANtron: Emotional Speech Synthesis with Generative Adversarial Networks0
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models0
Emphasis control for parallel neural TTS0
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis0
Neural Speech Synthesis in German0
Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system0
Conditioning Sequence-to-sequence Networks with Learned Activations0
Guided-TTS:Text-to-Speech with Untranscribed Speech0
FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis0
A Proposal of Automatic Error Correction in Text0
Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network0
On-device neural speech synthesis0
fairseq S^2: A Scalable and Integrable Speech Synthesis ToolkitCode0
Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis0
Integrated Speech and Gesture SynthesisCode0
Show:102550
← PrevPage 21 of 29Next →

No leaderboard results yet.