SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 751800 of 1419 papers

TitleStatusHype
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge0
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
SQuId: Measuring Speech Naturalness in Many Languages0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Facial Landmark Predictions with Applications to MetaverseCode0
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech0
Controllable Accented Text-to-Speech Synthesis0
MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied BaselineCode1
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
SANIP: Shopping Assistant and Navigation for the visually impaired0
Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model0
Visualising Model Training via Vowel Space for Text-To-Speech SystemsCode1
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale0
Speech Synthesis with Mixed Emotions0
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis0
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation0
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis0
When Is TTS Augmentation Through a Pivot Language Useful?Code0
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate0
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System0
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS0
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-SpeechCode3
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
End-to-end speech recognition modeling from de-identified data0
LIP: Lightweight Intelligent Preprocessor for meaningful text-to-speech0
Dreamento: an open-source dream engineering toolbox for sleep EEG wearablesCode1
BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpusCode1
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model0
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)0
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS)0
DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-SpeechCode2
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Fast Bilingual Grapheme-To-Phoneme Conversion0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
Building African VoicesCode1
Automatic Evaluation of Speaker Similarity0
R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS0
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder0
Improving Deliberation by Text-Only and Semi-Supervised Training0
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody0
Expressive, Variable, and Controllable Duration Modelling in TTS0
Comparison of Speech Representations for the MOS Prediction System0
Show:102550
← PrevPage 16 of 29Next →

No leaderboard results yet.