SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 626650 of 1419 papers

TitleStatusHype
Deep Shallow Fusion for RNN-T Personalization0
Deep Performer: Score-to-Audio Music Performance Synthesis0
A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Augmenting text for spoken language understanding with Large Language Models0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios0
Deep Denoising Auto-encoder for Statistical Speech Synthesis0
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation0
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Debatts: Zero-Shot Debating Text-to-Speech Synthesis0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Data Redaction from Conditional Generative Models0
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech0
AudioVisual Speech Synthesis: A brief literature review0
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style0
Accented Text-to-Speech Synthesis with Limited Data0
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
DASB -- Discrete Audio and Speech Benchmark0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
Show:102550
← PrevPage 26 of 57Next →

No leaderboard results yet.