SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 701725 of 1419 papers

TitleStatusHype
Large-Scale Automatic Audiobook Creation0
GRASS: Unified Generation Model for Speech-to-Semantic Tasks0
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 20230
PromptTTS 2: Describing and Generating Voices with Text Prompt0
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
The FruitShell French synthesis system at the Blizzard 2023 Challenge0
Learning Speech Representation From Contrastive Token-Acoustic Pretraining0
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information0
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis0
The DeepZen Speech Synthesis System for Blizzard Challenge 20230
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech0
Rep2wav: Noise Robust text-to-speech Using self-supervised representations0
Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations0
Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models0
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis0
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer0
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head GenerationCode0
Let's Give a Voice to Conversational Agents in Virtual RealityCode0
SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis0
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech0
Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings0
Multilingual context-based pronunciation learning for Text-to-Speech0
METTS: Multilingual Emotional Text-to-Speech by Cross-speaker and Cross-lingual Emotion Transfer0
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding0
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs0
Show:102550
← PrevPage 29 of 57Next →

No leaderboard results yet.