SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 751775 of 1419 papers

TitleStatusHype
Rep2wav: Noise Robust text-to-speech Using self-supervised representations0
Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction0
Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification0
中文轉客文文轉音系統中的客語斷詞處理之研究 (Research on Hakka Word Segmentation Processes in Chinese-to-Hakka Text-to-Speech System )[In Chinese]0
Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation0
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource Languages0
Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation0
Retrieval-Augmented Audio Deepfake Detection0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
Revisiting IPA-based Cross-lingual Text-to-speech0
Revisiting Over-Smoothness in Text to Speech0
Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis0
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation0
Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis0
R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS0
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization0
RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus0
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis0
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform0
S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation0
Sadeed: Advancing Arabic Diacritization Through Small Language Model0
SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction0
SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation0
SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis0
Show:102550
← PrevPage 31 of 57Next →

No leaderboard results yet.