SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 201225 of 1419 papers

TitleStatusHype
Benchmarking Large Multimodal Models against Common CorruptionsCode1
Multilingual Text-to-Speech Synthesis for Turkic Languages Using TransliterationCode1
FastPitch: Parallel Text-to-speech with Pitch PredictionCode1
Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length EmbeddingCode1
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
FastSpeech 2: Fast and High-Quality End-to-End Text to SpeechCode1
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnetCode1
BiSinger: Bilingual Singing Voice SynthesisCode1
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech UnderstandingCode1
Attention model for articulatory features detectionCode1
ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS AdaptationCode1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
Evaluating Speech Synthesis by Training Recognizers on Synthetic SpeechCode1
End-to-End Adversarial Text-to-SpeechCode1
End to End Lip Synchronization with a Temporal AutoEncoderCode1
Emotion-Aware Prosodic Phrasing for Expressive Text-to-SpeechCode1
End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition ModelCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Brilla AI: AI Contestant for the National Science and Maths QuizCode1
PRESENT: Zero-Shot Text-to-Prosody ControlCode1
Bts-e: Audio deepfake detection using breathing-talking-silence encoderCode1
EfficientSpeech: An On-Device Text to Speech ModelCode1
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to SpeechCode1
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style ConversionCode1
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTSCode1
Show:102550
← PrevPage 9 of 57Next →

No leaderboard results yet.