Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 1419 papers

Title	Date	Tasks	Status	Hype	Score
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model	May 11, 2023	DenoisingGPU	CodeCode Available	2	5
PAM: Prompting Audio-Language Models for Audio Quality Assessment	Feb 1, 2024	Audio Quality AssessmentMusic Generation	CodeCode Available	2	5
CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models	Mar 31, 2024	DenoisingSpeech Synthesis	CodeCode Available	2	5
Efficient Neural Audio Synthesis	Feb 23, 2018	Audio SynthesisCPU	CodeCode Available	2	5
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram	Oct 25, 2019	Generative Adversarial NetworkGPU	CodeCode Available	2	5
Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise Distillation	Mar 29, 2022	CPUDecoder	CodeCode Available	2	5
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control	Oct 1, 2024	Emotional Speech SynthesisSpeech Synthesis	CodeCode Available	2	5
PITS: Variational Pitch Inference without Fundamental Frequency for End-to-End Pitch-controllable TTS	Feb 24, 2023	Decodertext-to-speech	CodeCode Available	2	5
CATT: Character-based Arabic Tashkeel Transformer	Jul 3, 2024	Arabic Text DiacritizationDecoder	CodeCode Available	2	5
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech	Jun 12, 2024	Emotional Speech Synthesistext-to-speech	CodeCode Available	2	5
DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech	Jul 3, 2022	text-to-speechText to Speech	CodeCode Available	2	5
Neural Speech Synthesis with Transformer Network	Sep 19, 2018	DecoderMachine Translation	CodeCode Available	2	5
RWKVTTS: Yet another TTS based on RWKV-7	Apr 4, 2025	Computational Efficiencytext-to-speech	CodeCode Available	2	5
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers	Apr 18, 2023	In-Context LearningSpeech Synthesis	CodeCode Available	2	5
Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness	Apr 10, 2024	Speech Synthesistext-to-speech	CodeCode Available	2	5
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis	Oct 30, 2024	Speech Synthesistext-to-speech	CodeCode Available	2	5
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction	Oct 28, 2018	PredictionSpeech Synthesis	CodeCode Available	2	5
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning	Jun 12, 2024	text-to-speechText to Speech	CodeCode Available	2	5
A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech	Feb 8, 2023	Code GenerationDiversity	CodeCode Available	2	5
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform	Oct 28, 2022	CPUKnowledge Distillation	CodeCode Available	2	5
IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS	Sep 9, 2024	DenoisingSpeech Enhancement	CodeCode Available	2	5
iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform	Mar 4, 2022	Speech Synthesistext-to-speech	CodeCode Available	2	5
Audio Deepfake Detection with Self-Supervised XLS-R and SLS Classifier	Oct 28, 2024	Audio Deepfake DetectionAudio Generation	CodeCode Available	2	5
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors	Jun 17, 2024	text-to-speechText to Speech	CodeCode Available	2	5
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech	May 15, 2022	Speech SynthesisStyle Transfer	CodeCode Available	2	5

Show:10 25 50

← PrevPage 3 of 57Next →

No leaderboard results yet.