SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 526550 of 1419 papers

TitleStatusHype
Deliberation Model for On-Device Spoken Language Understanding0
A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis0
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model0
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios0
A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech0
A unified front-end framework for English text-to-speech synthesis0
Deep Text-to-Speech System with Seq2Seq Model0
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
FlexSpeech: Towards Stable, Controllable and Expressive Text-to-Speech0
Deep Shallow Fusion for RNN-T Personalization0
Deep Performer: Score-to-Audio Music Performance Synthesis0
A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Augmenting text for spoken language understanding with Large Language Models0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Deep Denoising Auto-encoder for Statistical Speech Synthesis0
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation0
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Debatts: Zero-Shot Debating Text-to-Speech Synthesis0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Data Redaction from Conditional Generative Models0
Show:102550
← PrevPage 22 of 57Next →

No leaderboard results yet.