SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 951975 of 1419 papers

TitleStatusHype
Une aide \`a la communication par pictogrammes avec pr\'ediction s\'emantique0
UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation0
Unified speech and gesture synthesis using flow matching0
UniFLG: Unified Facial Landmark Generator from Text or Speech0
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)0
UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion0
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation0
Unsupervised Data Validation Methods for Efficient Model Training0
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Unsupervised Polyglot Text To Speech0
Unsupervised pre-training for sequence to sequence speech recognition0
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis0
Unsupervised word-level prosody tagging for controllable speech synthesis0
Controllable Speaking Styles Using a Large Language Model0
Using Audio Books for Training a Text-to-Speech System0
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition0
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement0
Using previous acoustic context to improve Text-to-Speech synthesis0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Using the LARA Little Prince to compare human and TTS audio quality0
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech0
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction0
Utilizing Speech Emotion Recognition and Recommender Systems for Negative Emotion Handling in Therapy Chatbots0
Show:102550
← PrevPage 39 of 57Next →

No leaderboard results yet.