SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 801850 of 1419 papers

TitleStatusHype
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus0
Varianceflow: High-Quality and Controllable Text-to-Speech using Variance Information via Normalizing Flow0
Duration-aware pause insertion using pre-trained language model for multi-speaker text-to-speech0
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition0
Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages0
MAC: A unified framework boosting low resource automatic speech recognition0
UzbekTagger: The rule-based POS tagger for Uzbek language0
Time out of Mind: Generating Rate of Speech conditioned on emotion and speakerCode0
On granularity of prosodic representations in expressive text-to-speech0
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case StudyCode0
Modelling low-resource accents without accent-specific TTS frontend0
UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion0
Applying Automated Machine Translation to Educational Video Courses0
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition0
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
HMM-based data augmentation for E2E systems for building conversational speech synthesis systems0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
Improving the quality of neural TTS using long-form content and multi-speaker multi-style modeling0
TTS-Guided Training for Accent Conversion Without Parallel Data0
Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language0
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder0
Speech Aware Dialog System Technology Challenge (DSTC11)0
Probing Deep Speaker Embeddings for Speaker-related Tasks0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
Low-Resource End-to-end Sanskrit TTS using Tacotron2, WaveGlow and Transfer Learning0
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech0
Controllable speech synthesis by learning discrete phoneme-level prosodic representations0
Evaluating and reducing the distance between synthetic and real speech distributions0
Contextual Expressive Text-to-Speech0
Efficient Incremental Text-to-Speech on GPUs0
IMaSC -- ICFOSS Malayalam Speech Corpus0
PromptTTS: Controllable Text-to-Speech with Text DescriptionsCode0
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
SNIPER Training: Single-Shot Sparse Training for Text-to-Speech0
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
Parallel Attention Forcing for Machine Translation0
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech0
Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features0
Generating Multilingual Gender-Ambiguous Text-to-Speech Voices0
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages0
Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers0
Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation0
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection0
Structured State Space Decoder for Speech Recognition and Synthesis0
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders0
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis0
Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation0
Show:102550
← PrevPage 17 of 29Next →

No leaderboard results yet.