SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 451500 of 1419 papers

TitleStatusHype
AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration Modeling0
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition0
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model0
EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems0
Emphasis control for parallel neural TTS0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis0
Emphasizing Unseen Words: New Vocabulary Acquisition for End-to-End Speech Recognition0
Building Text-to-Speech Systems for Resource Poor Languages0
Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis0
EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech0
Autoregressive Speech Synthesis with Next-Distribution Prediction0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator0
End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec20
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis0
End-to-end speech recognition modeling from de-identified data0
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue0
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning0
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch0
Enhancing audio quality for expressive Neural Text-to-Speech0
Enhancing Crowdsourced Audio for Text-to-Speech Models0
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap0
Diacritization of Maghrebi Arabic Sub-Dialects0
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech0
Enhancing Speech-to-Speech Translation with Multiple TTS Targets0
An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR0
Expressive Neural Voice Cloning0
Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations0
Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback0
Ensemble prosody prediction for expressive speech synthesis0
Environment Aware Text-to-Speech Synthesis0
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models0
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech0
Development of Smartcall Vietnamese Text-to-Speech for VLSP 20200
ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs0
ESPnet2-TTS: Extending the Edge of TTS Research0
Automatic Speech Recognition for Hindi0
ESPnet-ST: All-in-One Speech Translation Toolkit0
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment0
Evaluating and Improving Automatic Speech Recognition Systems for Korean Meteorological Experts0
Evaluating and Personalizing User-Perceived Quality of Text-to-Speech Voices for Delivering Mindfulness Meditation with Different Physical Embodiments0
Evaluating and reducing the distance between synthetic and real speech distributions0
Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs0
Development of Marathi Part of Speech Tagger Using Statistical Approach0
Show:102550
← PrevPage 10 of 29Next →

No leaderboard results yet.