SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 226250 of 1419 papers

TitleStatusHype
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech ModelCode1
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence TrainingCode1
AdaSpeech: Adaptive Text to Speech for Custom VoiceCode1
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture SearchCode1
Bidirectional Variational Inference for Non-Autoregressive Text-to-SpeechCode1
Unified Mandarin TTS Front-end Based on Distilled BERT ModelCode1
Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph EntitiesCode1
Universal MelGAN: A Robust Neural Vocoder for High-Fidelity Waveform Generation in Multiple DomainsCode1
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesisCode1
Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph EntitiesCode1
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive NormalizationCode1
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine TranslationCode1
Effective Deep Learning Models for Automatic Diacritization of Arabic TextCode1
One-class learning towards generalized voice spoofing detectionCode1
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An OverviewCode1
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration ModelingCode1
Accent Estimation of Japanese Words from Their Surfaces and Romanizations for Building Large Vocabulary Accent DictionariesCode1
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style ConversionCode1
Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length EmbeddingCode1
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording ConditionsCode1
Pretraining Techniques for Sequence-to-Sequence Voice ConversionCode1
Phonological Features for 0-shot Multilingual Speech SynthesisCode1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
FastPitch: Parallel Text-to-speech with Pitch PredictionCode1
FastSpeech 2: Fast and High-Quality End-to-End Text to SpeechCode1
Show:102550
← PrevPage 10 of 57Next →

No leaderboard results yet.