Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1419 papers

Title	Date	Tasks	Status
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions	May 30, 2023	AllAutomatic Speech Recognition	—Unverified
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent	Mar 28, 2022	text-to-speechText to Speech	—Unverified
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation	Apr 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech	Nov 4, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN	Oct 27, 2023	DecoderDenoising	—Unverified
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models	Oct 6, 2021	text-to-speechText to Speech	—Unverified
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis	Sep 24, 2024	Speech Synthesistext-to-speech	—Unverified
Style Mixture of Experts for Expressive Text-To-Speech Synthesis	Jun 5, 2024	Mixture-of-ExpertsSpeech Synthesis	—Unverified
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech	Mar 17, 2021	Speech SynthesisStyle Transfer	—Unverified
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation	Aug 13, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion	Sep 16, 2024	Speech Synthesistext-to-speech	—Unverified
Style Variation as a Vantage Point for Code-Switching	May 1, 2020	Language ModelingLanguage Modelling	—Unverified
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System	Mar 29, 2025	Speech Synthesistext-to-speech	—Unverified
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition	Jun 5, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer	Feb 16, 2025	text-to-speechText to Speech	—Unverified
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal	Dec 13, 2020	DiversityRepresentation Learning	—Unverified
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech	Nov 24, 2020	Data AugmentationSpeaker Recognition	—Unverified
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments	Jul 23, 2024	DiversityKeyword Spotting	—Unverified
SynthASR: Unlocking Synthetic Data for Speech Recognition	Jun 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition	Jan 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations	Jun 25, 2022	text-to-speechText to Speech	—Unverified
Synthetic Speaking Children -- Why We Need Them and How to Make Them	Nov 8, 2023	text-to-speechText to Speech	—Unverified
Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features	Sep 29, 2023	Synthetic Speech Detectiontext-to-speech	—Unverified
Talking Face Generation with Multilingual TTS	May 13, 2022	Face GenerationTalking Face Generation	—Unverified
Talrómur: A large Icelandic TTS corpus	May 1, 2021	text-to-speechText to Speech	—Unverified
Statistical Context-Dependent Units Boundary Correction for Corpus-based Unit-Selection Text-to-Speech	Mar 5, 2020	Segmentationtext-to-speech	—Unverified
Teacher-Student Training for Robust Tacotron-based TTS	Nov 7, 2019	DecoderKnowledge Distillation	—Unverified
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages	Nov 1, 2022	ChunkingRhythm	—Unverified
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale	Feb 27, 2025	AI AgentLarge Language Model	—Unverified
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise	Jun 13, 2019	Data AugmentationDecoder	—Unverified
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations	May 8, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Text-aware and Context-aware Expressive Audiobook Speech Synthesis	Jun 9, 2024	Contrastive LearningLanguage Modeling	—Unverified
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS	Jul 13, 2022	Language ModelingLanguage Modelling	—Unverified
Text-free non-parallel many-to-many voice conversion using normalising flows	Mar 15, 2022	Normalising FlowsSpeech Synthesis	—Unverified
Text Generation with Speech Synthesis for ASR Data Augmentation	May 22, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis	Mar 27, 2023	AllAutomatic Speech Recognition	—Unverified
Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor	Dec 1, 2024	AllNatural Language Understanding	—Unverified
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens	Oct 4, 2024	Language ModelingLanguage Modelling	—Unverified
Text-To-Speech Data Augmentation for Low Resource Speech Recognition	Apr 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Text-To-Speech for Languages without an Orthography	Dec 1, 2012	Speech Synthesistext-to-speech	—Unverified
Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning	Jun 1, 2022	Cross-Lingual Transfertext-to-speech	—Unverified
Text-to-Speech Pipeline for Swiss German -- A comparison	May 31, 2023	Speech Synthesistext-to-speech	—Unverified
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder	Dec 16, 2022	Representation LearningSpeech Synthesis	—Unverified
Text-To-Speech Synthesis In The Wild	Sep 13, 2024	BenchmarkingSpeaker Recognition	—Unverified
Textual Echo Cancellation	Aug 13, 2020	Acoustic echo cancellationspeech-recognition	—Unverified
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives	Sep 17, 2024	text-to-speechText to Speech	—Unverified
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese	May 1, 2012	text-to-speechText to Speech	—Unverified
The DeepZen Speech Synthesis System for Blizzard Challenge 2023	Aug 30, 2023	SentenceSpeech Synthesis	—Unverified
The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech	Jun 1, 2023	Cross-Lingual Transfertext-to-speech	—Unverified
The FruitShell French synthesis system at the Blizzard 2023 Challenge	Sep 1, 2023	Data AugmentationSpeech Synthesis	—Unverified

Show:10 25 50

← PrevPage 18 of 29Next →

No leaderboard results yet.