Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1350 of 1419 papers

Title	Date	Tasks	Status
Towards Fully Automatic Annotation of Audio Books for TTS	May 1, 2012	Speech RecognitionSpeech Synthesis	—Unverified
Towards human-like spoken dialogue generation between AI agents from written dialogue	Oct 2, 2023	Dialogue Generationtext-to-speech	—Unverified
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement	Jan 15, 2025	Computational EfficiencyCPU	—Unverified
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale	Aug 21, 2022	LipreadingLip Reading	—Unverified
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram	Feb 3, 2021	text-to-speechText to Speech	—Unverified
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion	Oct 16, 2020	Speech Synthesistext-to-speech	—Unverified
Towards Optimizing OCR for Accessibility	Jun 21, 2022	Optical Character Recognition (OCR)text-to-speech	—Unverified
Towards Robust FastSpeech 2 by Modelling Residual Multimodality	Jun 2, 2023	Decodertext-to-speech	—Unverified
Towards Robust Neural Vocoding for Speech Generation: A Survey	Dec 5, 2019	Speech SynthesisSurvey	—Unverified
Prosody Analysis of Audiobooks	Oct 10, 2023	AttributeLanguage Modeling	CodeCode Available
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit	Oct 24, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Systematic Inequalities in Language Technology Performance across the World's Languages	Oct 13, 2021	Dependency ParsingMachine Translation	CodeCode Available
Systematic Inequalities in Language Technology Performance across the World’s Languages	May 1, 2022	Dependency ParsingMachine Translation	CodeCode Available
Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding	Jul 12, 2024	regressiontext-to-speech	CodeCode Available
FPETS : Fully Parallel End-to-End Text-to-Speech System	Dec 12, 2018	text-to-speechText to Speech	CodeCode Available
QSpeech: Low-Qubit Quantum Speech Application Toolkit	May 26, 2022	text-to-speechText to Speech	CodeCode Available
PromptTTS: Controllable Text-to-Speech with Text Descriptions	Nov 22, 2022	DecoderSpeech Synthesis	CodeCode Available
FluentEditor2: Text-based Speech Editing by Modeling Multi-Scale Acoustic and Prosody Consistency	Sep 28, 2024	Text to Speech	CodeCode Available
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022	May 1, 2022	DecoderKnowledge Distillation	CodeCode Available
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder	Nov 20, 2020	Model CompressionQuantization	CodeCode Available
Few-Shot Speech Deepfake Detection Adaptation with Gaussian Processes	May 29, 2025	Audio Deepfake DetectionDeepFake Detection	CodeCode Available
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling	Oct 12, 2024	text-to-speechText to Speech	CodeCode Available
Direct speech-to-speech translation with a sequence-to-sequence model	Apr 12, 2019	Speech SynthesisSpeech-to-Speech Translation	CodeCode Available
Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting	Feb 19, 2024	Language ModelingLanguage Modelling	CodeCode Available
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021	Oct 25, 2021	Speech Synthesistext-to-speech	CodeCode Available
Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish	May 31, 2022	Machine TranslationSpeech Synthesis	CodeCode Available
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit	Sep 14, 2021	Speech Synthesistext-to-speech	CodeCode Available
SPEECH-COCO: 600k Visually Grounded Spoken Captions Aligned to MSCOCO Data Set	Jul 26, 2017	text-to-speechText to Speech	CodeCode Available
SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network	Jul 17, 2024	text-to-speechText to Speech	CodeCode Available
Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming	Jun 5, 2023	Bayesian InferenceSinging Voice Synthesis	CodeCode Available
Emotional Voice Conversion using Multitask Learning with Text-to-speech	Nov 11, 2019	Decodertext-to-speech	CodeCode Available
JSSS: free Japanese speech corpus for summarization and simplification	Oct 5, 2020	FormSpeech Synthesis	CodeCode Available
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities	Dec 26, 2024	Domain AdaptationLanguage Modeling	CodeCode Available
Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis	Oct 9, 2021	Lifelong learningSpeech Synthesis	CodeCode Available
AI4D -- African Language Program	Apr 6, 2021	Machine Translationspeech-recognition	CodeCode Available
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer	Oct 12, 2018	text-to-speechText to Speech	CodeCode Available
Predicting distributions with Linearizing Belief Networks	Nov 17, 2015	DenoisingFacial expression generation	CodeCode Available
Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input	Jul 5, 2021	Speech Synthesistext-to-speech	CodeCode Available
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning	Jun 5, 2020	Self-Supervised LearningSpeaker Verification	CodeCode Available
Deep Voice: Real-time Neural Text-to-Speech	Feb 25, 2017	Audio SynthesisBoundary Detection	CodeCode Available
IsoChronoMeter: A simple and effective isochronic translation evaluation metric	Oct 14, 2024	Machine Translationtext-to-speech	CodeCode Available
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning	Oct 20, 2017	GPUSpeech Synthesis	CodeCode Available
EmoNews: A Spoken Dialogue System for Expressive News Conversations	Jun 16, 2025	Language ModelingLanguage Modelling	CodeCode Available
When Is TTS Augmentation Through a Pivot Language Useful?	Jul 20, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Facial Landmark Predictions with Applications to Metaverse	Sep 29, 2022	Decodertext-to-speech	CodeCode Available
Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging	Jul 12, 2021	PredictionSpeech Synthesis	CodeCode Available
Text-to-ECG: 12-Lead Electrocardiogram Synthesis conditioned on Clinical Text Reports	Mar 9, 2023	text-to-speechText to Speech	CodeCode Available
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study	Jan 22, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
PolyGlotFake: A Novel Multilingual and Multimodal DeepFake Dataset	May 14, 2024	DeepFake DetectionFace Swapping	CodeCode Available
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language	Oct 29, 2018	Speech Synthesistext-to-speech	CodeCode Available

Show:10 25 50

← PrevPage 27 of 29Next →

No leaderboard results yet.