Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 326–350 of 1419 papers

Title	Date	Tasks	Status	Score
Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis	Oct 23, 2019	FormSpeech Synthesis	CodeCode Available	5
LibriS2S: A German-English Speech-to-Speech Translation Corpus	Apr 22, 2022	Speech-to-Speech TranslationSpeech-to-Text	CodeCode Available	5
Let's Give a Voice to Conversational Agents in Virtual Reality	Aug 4, 2023	Speech-to-Texttext-to-speech	CodeCode Available	5
Learning Speaker Embedding from Text-to-Speech	Oct 21, 2020	ClassificationDecoder	CodeCode Available	5
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming	Jun 5, 2023	Bayesian InferenceSinging Voice Synthesis	CodeCode Available	5
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages	Mar 27, 2019	text-to-speechText to Speech	CodeCode Available	5
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities	Dec 26, 2024	Domain AdaptationLanguage Modeling	CodeCode Available	5
IsoChronoMeter: A simple and effective isochronic translation evaluation metric	Oct 14, 2024	Machine Translationtext-to-speech	CodeCode Available	5
JSSS: free Japanese speech corpus for summarization and simplification	Oct 5, 2020	FormSpeech Synthesis	CodeCode Available	5
Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding	Jul 12, 2024	regressiontext-to-speech	CodeCode Available	5
Independent and automatic evaluation of acoustic-to-articulatory inversion models	Nov 15, 2019	speech-recognitionSpeech Recognition	CodeCode Available	5
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment	Dec 4, 2020	Meta-Learningtext-to-speech	CodeCode Available	5
Attentive Multi-Layer Perceptron for Non-autoregressive Generation	Oct 14, 2023	Machine TranslationSpeech Synthesis	CodeCode Available	5
High Fidelity Speech Synthesis with Adversarial Networks	Sep 25, 2019	Generative Adversarial NetworkSpeech Synthesis	CodeCode Available	5
Attention Forcing for Machine Translation	Apr 2, 2021	Machine TranslationNMT	CodeCode Available	5
Hierarchical Generative Modeling for Controllable Speech Synthesis	Oct 16, 2018	AttributeSpeech Synthesis	CodeCode Available	5
Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation	Mar 31, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
Integrated Speech and Gesture Synthesis	Aug 25, 2021	Speech Synthesistext-to-speech	CodeCode Available	5
Generating Synthetic Speech from SpokenVocab for Speech Translation	Oct 15, 2022	Data AugmentationMachine Translation	CodeCode Available	5
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram	Apr 8, 2019	Speech Synthesistext-to-speech	CodeCode Available	5
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition	Aug 17, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging	Jul 26, 2021	text-to-speechText to Speech	CodeCode Available	5
FPETS : Fully Parallel End-to-End Text-to-Speech System	Dec 12, 2018	text-to-speechText to Speech	CodeCode Available	5
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning	Oct 20, 2017	GPUSpeech Synthesis	CodeCode Available	5
Few-Shot Speech Deepfake Detection Adaptation with Gaussian Processes	May 29, 2025	Audio Deepfake DetectionDeepFake Detection	CodeCode Available	5

Show:10 25 50

← PrevPage 14 of 57Next →

No leaderboard results yet.