SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 801825 of 1419 papers

TitleStatusHype
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
An End-to-End Neural Network for Image-to-Audio Transformation0
A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music0
A New Approach to Voice Authenticity0
An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
An Expert System for Automatic Reading of A Text Written in Standard Arabic0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
An In-depth Analysis of the Effect of Text Normalization in Social Media0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples0
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
An overview of text-to-speech systems and media applications0
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
基於字元階層之語音合成用文脈訊息擷取 (Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
基於字元階層之語音合成用文脈訊息擷取(Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models0
Show:102550
← PrevPage 33 of 57Next →

No leaderboard results yet.