SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 851900 of 1419 papers

TitleStatusHype
Explicit Intensity Control for Accented Text-to-speech0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
Improving Speech-to-Speech Translation Through Unlabeled Text0
Semi-Supervised Learning Based on Reference Model for Low-resource TTS0
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data0
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS0
Low-Resource Multilingual and Zero-Shot Multispeaker TTS0
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification0
Generating Synthetic Speech from SpokenVocab for Speech TranslationCode0
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar0
SQuId: Measuring Speech Naturalness in Many Languages0
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Facial Landmark Predictions with Applications to MetaverseCode0
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech0
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models0
Controllable Accented Text-to-Speech Synthesis0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
SANIP: Shopping Assistant and Navigation for the visually impaired0
Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model0
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale0
Speech Synthesis with Mixed Emotions0
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis0
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation0
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis0
When Is TTS Augmentation Through a Pivot Language Useful?Code0
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate0
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System0
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS0
End-to-end speech recognition modeling from de-identified data0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
LIP: Lightweight Intelligent Preprocessor for meaningful text-to-speech0
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS)0
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model0
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)0
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
Fast Bilingual Grapheme-To-Phoneme Conversion0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Automatic Evaluation of Speaker Similarity0
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder0
R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS0
Improving Deliberation by Text-Only and Semi-Supervised Training0
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody0
Comparison of Speech Representations for the MOS Prediction System0
Show:102550
← PrevPage 18 of 29Next →

No leaderboard results yet.