Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1351–1375 of 1419 papers

Title	Date	Tasks	Status
RNN Approaches to Text Normalization: A Challenge	Oct 31, 2016	Text Normalizationtext-to-speech	CodeCode Available
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis	Apr 26, 2021	Language ModelingLanguage Modelling	CodeCode Available
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale	Jun 23, 2023	In-Context LearningSpeech Synthesis	CodeCode Available
A Comparative Study on Transformer vs RNN in Speech Applications	Sep 13, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis	Oct 29, 2019	Speaker VerificationSpeech Synthesis	CodeCode Available
Non-Autoregressive Neural Text-to-Speech	May 21, 2019	text-to-speechText to Speech	CodeCode Available
ObamaNet: Photo-realistic lip-sync from text	Dec 6, 2017	Constrained Lip-synchronizationtext-to-speech	CodeCode Available
AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment	Mar 4, 2020	text-to-speechText to Speech	CodeCode Available
Numbers Normalisation in the Inflected Languages: a Case Study of Polish	Aug 1, 2019	text-to-speechText to Speech	CodeCode Available
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks	Apr 1, 2019	Feature Engineeringtext-to-speech	CodeCode Available
BanglaFake: Constructing and Evaluating a Specialized Bengali Deepfake Audio Dataset	May 16, 2025	DeepFake DetectionFace Swapping	CodeCode Available
Neural Voice Puppetry: Audio-driven Facial Reenactment	Dec 11, 2019	Face ModelNeural Rendering	CodeCode Available
Integrated Speech and Gesture Synthesis	Aug 25, 2021	Speech Synthesistext-to-speech	CodeCode Available
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation	Aug 12, 2023	Talking Head Generationtext-to-speech	CodeCode Available
Independent and automatic evaluation of acoustic-to-articulatory inversion models	Nov 15, 2019	speech-recognitionSpeech Recognition	CodeCode Available
Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks	Sep 23, 2017	Speech Synthesistext-to-speech	CodeCode Available
Naturalization of Text by the Insertion of Pauses and Filler Words	Nov 7, 2020	Sentencetext-to-speech	CodeCode Available
Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation	Mar 31, 2024	Language ModelingLanguage Modelling	CodeCode Available
Deep Voice 2: Multi-Speaker Neural Text-to-Speech	May 24, 2017	Speech Synthesistext-to-speech	CodeCode Available
SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis	Aug 13, 2024	Speech SynthesisSpoken Dialogue Systems	CodeCode Available
Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech	Oct 29, 2024	Decodertext-to-speech	CodeCode Available
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages	Mar 27, 2019	text-to-speechText to Speech	CodeCode Available
AraSpot: Arabic Spoken Command Spotting	Mar 29, 2023	Data AugmentationKeyword Spotting	CodeCode Available
Multi-Source Spatial Knowledge Understanding for Immersive Visual Text-to-Speech	Oct 18, 2024	object-detectionObject Detection	CodeCode Available
Multimodal Latent Language Modeling with Next-Token Diffusion	Dec 11, 2024	Image GenerationLanguage Modeling	CodeCode Available

Show:10 25 50

← PrevPage 55 of 57Next →

No leaderboard results yet.