SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 301325 of 1419 papers

TitleStatusHype
SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural NetworkCode0
Neural Voice Puppetry: Audio-driven Facial ReenactmentCode0
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic ForgettingCode0
An investigation of phrase break prediction in an End-to-End TTS systemCode0
Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-SpeechCode0
BanglaFake: Constructing and Evaluating a Specialized Bengali Deepfake Audio DatasetCode0
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-SpeechCode0
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architectureCode0
Multimodal Latent Language Modeling with Next-Token DiffusionCode0
Naturalization of Text by the Insertion of Pauses and Filler WordsCode0
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State TransducersCode0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
MelNet: A Generative Model for Audio in the Frequency DomainCode0
Luganda Text-to-Speech MachineCode0
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the BibleCode0
Location-Relative Attention Mechanisms For Robust Long-Form Speech SynthesisCode0
Massively Multilingual Neural Grapheme-to-Phoneme ConversionCode0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
Let's Give a Voice to Conversational Agents in Virtual RealityCode0
Learning High-Frequency Functions Made Easy with Sinusoidal Positional EncodingCode0
Learning Speaker Embedding from Text-to-SpeechCode0
Audio Super Resolution using Neural NetworksCode0
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic ProgrammingCode0
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen EntitiesCode0
JSSS: free Japanese speech corpus for summarization and simplificationCode0
Show:102550
← PrevPage 13 of 57Next →

No leaderboard results yet.