Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–175 of 1419 papers

Title	Date	Tasks	Status	Hype
Improving TTS for Shanghainese: Addressing Tone Sandhi via Word Segmentation	Jul 30, 2023	text-to-speechText to Speech	CodeCode Available	1
ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus	Jul 29, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
SC VALL-E: Style-Controllable Zero-Shot Text to Speech Synthesizer	Jul 20, 2023	Expressive Speech SynthesisLanguage Modelling	CodeCode Available	1
Text + Sketch: Image Compression at Ultra Low Rates	Jul 4, 2023	Image CompressionText to Speech	CodeCode Available	1
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to Speech	Jun 28, 2023	Emotion RecognitionSpeech Synthesis	CodeCode Available	1
Towards Building Voice-based Conversational Recommender Systems: Datasets, Potential Solutions, and Prospects	Jun 14, 2023	Recommendation Systemstext-to-speech	CodeCode Available	1
ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation	May 29, 2023	Speech Synthesistext-to-speech	CodeCode Available	1
Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS	May 28, 2023	Diversitytext-to-speech	CodeCode Available	1
An Efficient Membership Inference Attack for the Diffusion Model by Proximal Initialization	May 26, 2023	Audio GenerationInference Attack	CodeCode Available	1
Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration	May 25, 2023	Speech Synthesistext-to-speech	CodeCode Available	1
EfficientSpeech: An On-Device Text to Speech Model	May 23, 2023	CPUmodel	CodeCode Available	1
EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels	May 22, 2023	Expressive Speech SynthesisSpeech Synthesis	CodeCode Available	1
Parameter-Efficient Learning for Text-to-Speech Accent Adaptation	May 18, 2023	Decodertext-to-speech	CodeCode Available	1
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation	May 18, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data	May 18, 2023	Speech EnhancementSpeech Synthesis	CodeCode Available	1
Bts-e: Audio deepfake detection using breathing-talking-silence encoder	May 5, 2023	Audio Deepfake DetectionDeepFake Detection	CodeCode Available	1
Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource Languages	Mar 28, 2023	Data Augmentationtext-to-speech	CodeCode Available	1
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations	Mar 3, 2023	Speech DenoisingSpeech Enhancement	CodeCode Available	1
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding	Mar 2, 2023	Speech Synthesistext-to-speech	CodeCode Available	1
Imaginary Voice: Face-styled Diffusion Model for Text-to-Speech	Feb 27, 2023	Speech Synthesistext-to-speech	CodeCode Available	1
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining	Jan 30, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech	Dec 30, 2022	Denoisingtext-to-speech	CodeCode Available	1
StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models	Dec 29, 2022	Data Augmentationtext-to-speech	CodeCode Available	1
RWEN-TTS: Relation-aware Word Encoding Network for Natural Text-to-Speech Synthesis	Dec 15, 2022	RelationSpeech Synthesis	CodeCode Available	1
MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset	Dec 11, 2022	Speech Synthesistext-to-speech	CodeCode Available	1

Show:10 25 50

← PrevPage 7 of 57Next →

No leaderboard results yet.