SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 451475 of 1419 papers

TitleStatusHype
Continuous Speech Synthesis using per-token Latent Diffusion0
Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-SpeechCode0
A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages0
Enhancing Crowdsourced Audio for Text-to-Speech Models0
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation0
DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
IsoChronoMeter: A simple and effective isochronic translation evaluation metricCode0
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context ModelingCode0
Unsupervised Data Validation Methods for Efficient Model Training0
Can DeepFake Speech be Reliably Detected?0
Efficient training strategies for natural sounding speech synthesis and speaker adaptation based on FastPitch0
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS0
SegINR: Segment-wise Implicit Neural Representation for Sequence Alignment in Neural Text-to-Speech0
HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis0
Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System0
MultiVerse: Efficient and Expressive Zero-Shot Multi-Task Text-to-Speech0
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens0
Generative Semantic Communication for Text-to-Speech Synthesis0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Accent conversion using discrete units with parallel data synthesized from controllable accented TTS0
Word-wise intonation model for cross-language TTS systems0
FluentEditor2: Text-based Speech Editing by Modeling Multi-Scale Acoustic and Prosody ConsistencyCode0
Show:102550
← PrevPage 19 of 57Next →

No leaderboard results yet.