SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 476500 of 1419 papers

TitleStatusHype
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control0
Exploring synthetic data for cross-speaker style transfer in style representation based TTS0
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions0
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis0
Beyond Text-to-Text: An Overview of Multimodal and Generative Artificial Intelligence for Education Using Topic Modeling0
Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech0
On the Feasibility of Fully AI-automated Vishing Attacks0
Zero-shot Cross-lingual Voice Transfer for TTS0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
Preference Alignment Improves Language Model-Based TTS0
Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference0
DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech0
Exploring an Inter-Pausal Unit (IPU) based Approach for Indic End-to-End TTS Systems0
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild0
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives0
SpMis: An Investigation of Synthetic Spoken Misinformation Detection0
Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora0
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization0
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion0
Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning0
Improving Robustness of Diffusion-Based Zero-Shot Speech Synthesis via Stable Formant Generation0
E1 TTS: Simple and Fast Non-Autoregressive TTS0
Text-To-Speech Synthesis In The Wild0
AccentBox: Towards High-Fidelity Zero-Shot Accent Generation0
HLTCOE JHU Submission to the Voice Privacy Challenge 20240
Show:102550
← PrevPage 20 of 57Next →

No leaderboard results yet.