SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 551575 of 1419 papers

TitleStatusHype
AudioVisual Speech Synthesis: A brief literature review0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech0
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style0
Accented Text-to-Speech Synthesis with Limited Data0
Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data0
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
DASB -- Discrete Audio and Speech Benchmark0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition0
Cycle-consistency training for end-to-end speech recognition0
Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language0
AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models0
An Algorithm Based on Empirical Methods, for Text-to-Tuneful-Speech Synthesis of Sanskrit Verse0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model0
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI0
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis0
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder0
An adaptable task-oriented dialog system for stand-alone embedded devices0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech0
AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation0
Show:102550
← PrevPage 23 of 57Next →

No leaderboard results yet.