SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 326350 of 1419 papers

TitleStatusHype
Location-Relative Attention Mechanisms For Robust Long-Form Speech SynthesisCode0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
Let's Give a Voice to Conversational Agents in Virtual RealityCode0
Learning Speaker Embedding from Text-to-SpeechCode0
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic ProgrammingCode0
CSS10: A Collection of Single Speaker Speech Datasets for 10 LanguagesCode0
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen EntitiesCode0
IsoChronoMeter: A simple and effective isochronic translation evaluation metricCode0
JSSS: free Japanese speech corpus for summarization and simplificationCode0
Learning High-Frequency Functions Made Easy with Sinusoidal Positional EncodingCode0
Independent and automatic evaluation of acoustic-to-articulatory inversion modelsCode0
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-AlignmentCode0
Attentive Multi-Layer Perceptron for Non-autoregressive GenerationCode0
High Fidelity Speech Synthesis with Adversarial NetworksCode0
Attention Forcing for Machine TranslationCode0
Hierarchical Generative Modeling for Controllable Speech SynthesisCode0
Humane Speech Synthesis through Zero-Shot Emotion and Disfluency GenerationCode0
Integrated Speech and Gesture SynthesisCode0
Generating Synthetic Speech from SpokenVocab for Speech TranslationCode0
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogramCode0
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech RecognitionCode0
Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue ImagingCode0
FPETS : Fully Parallel End-to-End Text-to-Speech SystemCode0
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence LearningCode0
Few-Shot Speech Deepfake Detection Adaptation with Gaussian ProcessesCode0
Show:102550
← PrevPage 14 of 57Next →

No leaderboard results yet.