SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 851900 of 1419 papers

TitleStatusHype
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions0
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent0
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation0
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech0
Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN0
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models0
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis0
Style Mixture of Experts for Expressive Text-To-Speech Synthesis0
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech0
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation0
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion0
Style Variation as a Vantage Point for Code-Switching0
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System0
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition0
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer0
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal0
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech0
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments0
SynthASR: Unlocking Synthetic Data for Speech Recognition0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations0
Synthetic Speaking Children -- Why We Need Them and How to Make Them0
Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features0
Talking Face Generation with Multilingual TTS0
Talrómur: A large Icelandic TTS corpus0
Statistical Context-Dependent Units Boundary Correction for Corpus-based Unit-Selection Text-to-Speech0
Teacher-Student Training for Robust Tacotron-based TTS0
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages0
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale0
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise0
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations0
Text-aware and Context-aware Expressive Audiobook Speech Synthesis0
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS0
Text-free non-parallel many-to-many voice conversion using normalising flows0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis0
Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor0
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Text-To-Speech for Languages without an Orthography0
Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning0
Text-to-Speech Pipeline for Swiss German -- A comparison0
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder0
Text-To-Speech Synthesis In The Wild0
Textual Echo Cancellation0
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives0
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese0
The DeepZen Speech Synthesis System for Blizzard Challenge 20230
The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech0
The FruitShell French synthesis system at the Blizzard 2023 Challenge0
Show:102550
← PrevPage 18 of 29Next →

No leaderboard results yet.