SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 226250 of 1419 papers

TitleStatusHype
End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition ModelCode1
End to End Lip Synchronization with a Temporal AutoEncoderCode1
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech ModelCode1
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style ConversionCode1
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech UnderstandingCode1
Emotion-Aware Prosodic Phrasing for Expressive Text-to-SpeechCode1
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to SpeechCode1
End-to-End Adversarial Text-to-SpeechCode1
Evaluating Speech Synthesis by Training Recognizers on Synthetic SpeechCode1
FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech DetectionCode1
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided AttentionCode1
Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-based Multi-modal Context ModelingCode1
EfficientSpeech: An On-Device Text to Speech ModelCode1
A Survey on Neural Speech SynthesisCode1
EdiTTS: Score-based Editing for Controllable Text-to-SpeechCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech SynthesizersCode1
Effective Deep Learning Models for Automatic Diacritization of Arabic TextCode1
Dreamento: an open-source dream engineering toolbox for sleep EEG wearablesCode1
Text + Sketch: Image Compression at Ultra Low RatesCode1
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTSCode1
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found DataCode1
EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional FusionCode1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novelsCode1
Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and MaliseetCode1
Show:102550
← PrevPage 10 of 57Next →

No leaderboard results yet.