SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 751775 of 1419 papers

TitleStatusHype
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge0
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
SQuId: Measuring Speech Naturalness in Many Languages0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Facial Landmark Predictions with Applications to MetaverseCode0
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech0
Controllable Accented Text-to-Speech Synthesis0
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied BaselineCode1
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
SANIP: Shopping Assistant and Navigation for the visually impaired0
Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model0
Visualising Model Training via Vowel Space for Text-To-Speech SystemsCode1
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale0
Speech Synthesis with Mixed Emotions0
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis0
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation0
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis0
When Is TTS Augmentation Through a Pivot Language Useful?Code0
Show:102550
← PrevPage 31 of 57Next →

No leaderboard results yet.