Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 1419 papers

Title	Date	Tasks	Status
Word-wise intonation model for cross-language TTS systems	Sep 30, 2024	Dynamic Time WarpingProsody Prediction	—Unverified
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation	May 14, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Your voice is your voice: Supporting Self-expression through Speech Generation and LLMs in Augmented and Alternative Communication	Mar 21, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Zero-shot Cross-lingual Voice Transfer for TTS	Sep 20, 2024	text-to-speechText to Speech	—Unverified
Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention	Jan 25, 2022	FormSpeech Synthesis	—Unverified
Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling	May 26, 2025	SentenceSpeech Synthesis	—Unverified
Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment	Sep 11, 2024	text-to-speechText to Speech	—Unverified
Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora	Sep 17, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Zero-Shot Text-to-Speech for Vietnamese	Jun 2, 2025	text-to-speechText to Speech	—Unverified
Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model	Apr 24, 2023	RhythmSelf-Supervised Learning	—Unverified
Zero-Shot vs. Few-Shot Multi-Speaker TTS Using Pre-trained Czech SpeechT5 Model	Jul 24, 2024	text-to-speechText to Speech	—Unverified
ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models	May 23, 2023	Speech Synthesistext-to-speech	—Unverified
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities	May 29, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech	Aug 28, 2023	Domain Generalizationtext-to-speech	—Unverified
Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis	Apr 14, 2025	Language ModelingLanguage Modelling	—Unverified
Punjabi Text-To-Speech Synthesis System	Dec 1, 2012	Speech Synthesistext-to-speech	—Unverified
運用Python結合語音辨識及合成技術於自動化音文同步之實作(A Python Implementation of Automatic Speech-text Synchronization Using Speech Recognition and Text-to-Speech Technology)[In Chinese]	Oct 1, 2015	speech-recognitionSpeech Recognition	—Unverified
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis	Mar 14, 2023	Emotional Speech SynthesisSentence	—Unverified
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis	Apr 4, 2024	Language ModelingLanguage Modelling	—Unverified
Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning	Dec 2, 2023	Decodertext-to-speech	—Unverified
RASMALAI: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations	May 24, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified
RDSinger: Reference-based Diffusion Network for Singing Voice Synthesis	Oct 29, 2024	DenoisingSinging Voice Synthesis	—Unverified
Reading Assistance through LARA, the Learning And Reading Assistant	Jun 1, 2022	text-to-speechText to Speech	—Unverified
Real-Time Pill Identification for the Visually Impaired Using Deep Learning	May 8, 2024	Deep LearningManagement	—Unverified
ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence	May 9, 2022	Speech Synthesistext-to-speech	—Unverified
Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis	Sep 8, 2021	Expressive Speech SynthesisSentence	—Unverified
Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images	Sep 1, 2017	Referring ExpressionReferring expression generation	—Unverified
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss	Apr 28, 2022	text-to-speechText to Speech	—Unverified
Reinforce-Aligner: Reinforcement Alignment Search for Robust End-to-End Text-to-Speech	Jun 5, 2021	text-to-speechText to Speech	—Unverified
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability	Apr 3, 2021	Emotion Recognitionreinforcement-learning	—Unverified
DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models	May 23, 2024	Image Generationreinforcement-learning	—Unverified
Rep2wav: Noise Robust text-to-speech Using self-supervised representations	Aug 28, 2023	Speech Enhancementtext-to-speech	—Unverified
Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction	Oct 20, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification	Apr 6, 2022	AttributeSpeaker Verification	—Unverified
中文轉客文文轉音系統中的客語斷詞處理之研究 (Research on Hakka Word Segmentation Processes in Chinese-to-Hakka Text-to-Speech System )[In Chinese]	Oct 1, 2014	text-to-speechText to Speech	—Unverified
Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation	Oct 28, 2022	text-to-speechText to Speech	—Unverified
Resource-Efficient Fine-Tuning Strategies for Automatic MOS Prediction in Text-to-Speech for Low-Resource Languages	May 30, 2023	Predictiontext-to-speech	—Unverified
Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation	Nov 19, 2024	text-to-speechText to Speech	—Unverified
Retrieval-Augmented Audio Deepfake Detection	Apr 22, 2024	Audio Deepfake DetectionDeepFake Detection	—Unverified
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement	Dec 21, 2022	Audio-Visual Speech RecognitionResynthesis	—Unverified
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration	Jan 1, 2023	Audio-Visual Speech RecognitionResynthesis	—Unverified
Revisiting IPA-based Cross-lingual Text-to-speech	Oct 14, 2021	text-to-speechText to Speech	—Unverified
Revisiting Over-Smoothness in Text to Speech	Feb 26, 2022	text-to-speechText to Speech	—Unverified
Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis	May 25, 2025	Speech Synthesistext-to-speech	—Unverified
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation	Feb 21, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis	Jun 5, 2023	RhythmSentence	—Unverified
R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS	Jun 30, 2022	DecoderGPU	—Unverified
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization	Jul 2, 2024	Inference OptimizationSpeech Synthesis	—Unverified
RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus	May 1, 2014	Speech Synthesistext-to-speech	—Unverified
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis	Jun 26, 2019	Speech Synthesistext-to-speech	—Unverified

Show:10 25 50

← PrevPage 23 of 29Next →

No leaderboard results yet.