SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 276300 of 1419 papers

TitleStatusHype
AraSpot: Arabic Spoken Command SpottingCode0
A Fully Time-domain Neural Model for Subband-based Speech SynthesizerCode0
QSpeech: Low-Qubit Quantum Speech Application ToolkitCode0
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022Code0
Preparing an Endangered Language for the Digital Age: The Case of Judeo-SpanishCode0
Predicting distributions with Linearizing Belief NetworksCode0
Applying Phonological Features in Multilingual Text-To-SpeechCode0
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake DatasetCode0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesisCode0
PromptTTS: Controllable Text-to-Speech with Text DescriptionsCode0
Non-Autoregressive Neural Text-to-SpeechCode0
ObamaNet: Photo-realistic lip-sync from textCode0
Prosody Analysis of AudiobooksCode0
Neural Voice Puppetry: Audio-driven Facial ReenactmentCode0
Naturalization of Text by the Insertion of Pauses and Filler WordsCode0
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-SpeechCode0
Multimodal Latent Language Modeling with Next-Token DiffusionCode0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-SpeechCode0
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo ConversionCode0
An Open Source Web Reader for Under-Resourced LanguagesCode0
SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural NetworkCode0
MelNet: A Generative Model for Audio in the Frequency DomainCode0
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic ForgettingCode0
Show:102550
← PrevPage 12 of 57Next →

No leaderboard results yet.