SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 851875 of 1419 papers

TitleStatusHype
Explicit Intensity Control for Accented Text-to-speech0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
Improving Speech-to-Speech Translation Through Unlabeled Text0
Semi-Supervised Learning Based on Reference Model for Low-resource TTS0
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data0
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS0
Low-Resource Multilingual and Zero-Shot Multispeaker TTS0
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification0
Generating Synthetic Speech from SpokenVocab for Speech TranslationCode0
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar0
SQuId: Measuring Speech Naturalness in Many Languages0
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Facial Landmark Predictions with Applications to MetaverseCode0
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech0
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models0
Controllable Accented Text-to-Speech Synthesis0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
SANIP: Shopping Assistant and Navigation for the visually impaired0
Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model0
Show:102550
← PrevPage 35 of 57Next →

No leaderboard results yet.