Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 1419 papers

Title	Date	Tasks	Status
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant	Mar 30, 2025	ChatbotCollaborative Inference	—Unverified
Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction	May 8, 2013	Speech SynthesisSpeech-to-Text	—Unverified
Speech Aware Dialog System Technology Challenge (DSTC11)	Dec 16, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks	Jul 26, 2024	Generative Adversarial NetworkSpeech Enhancement	—Unverified
Speech BERT Embedding For Improving Prosody in Neural TTS	Jun 8, 2021	Decodertext-to-speech	—Unverified
Speech denoising by parametric resynthesis	Apr 2, 2019	DenoisingResynthesis	—Unverified
Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody?	Oct 31, 2024	Rhythmspeech-recognition	—Unverified
Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis	Jul 8, 2025	Data AugmentationMixture-of-Experts	—Unverified
Speech Synthesis along Perceptual Voice Quality Dimensions	Jan 15, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified
Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning	Nov 16, 2021	speech-recognitionSpeech Recognition	—Unverified
Speech Synthesis of Code-Mixed Text	May 1, 2016	Language IdentificationSpeech Synthesis	—Unverified
Speech Synthesis with Mixed Emotions	Aug 11, 2022	AttributeEmotional Speech Synthesis	—Unverified
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation	May 30, 2025	Language ModelingLanguage Modelling	—Unverified
Speech to Speech Translation with Translatotron: A State of the Art Review	Feb 9, 2025	speech-recognitionSpeech Recognition	—Unverified
Speech to text and text to speech recognition systems-Areview	Mar 17, 2018	speech-recognitionSpeech Recognition	—Unverified
Speech-T: Transducer for Text to Speech and Beyond	Dec 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speech vocoding for laboratory phonology	Jan 22, 2016	Speech Synthesistext-to-speech	—Unverified
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer	Aug 14, 2023	Language ModelingLanguage Modelling	—Unverified
SpMis: An Investigation of Synthetic Spoken Misinformation Detection	Sep 17, 2024	Misinformationtext-to-speech	—Unverified
Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models	Jul 18, 2024	Language ModelingLanguage Modelling	—Unverified
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild	Sep 18, 2024	DeepFake DetectionDiversity	—Unverified
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech	May 27, 2025	Style Transfertext-to-speech	—Unverified
SQuId: Measuring Speech Naturalness in Many Languages	Oct 12, 2022	Diversitytext-to-speech	—Unverified
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech	Aug 20, 2024	RetrievalSelf-Supervised Learning	—Unverified
Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting	Dec 28, 2024	Speech Synthesistext-to-speech	—Unverified
StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations	Apr 23, 2024	text-to-speechText to Speech	—Unverified
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement	Jun 19, 2025	text-to-speechText to Speech	—Unverified
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation	Feb 4, 2025	Change DetectionGender Classification	—Unverified
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling	Jun 14, 2025	text-to-speechText to Speech	—Unverified
Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus	Jan 30, 2017	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Structured State Space Decoder for Speech Recognition and Synthesis	Oct 31, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions	May 30, 2023	AllAutomatic Speech Recognition	—Unverified
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent	Mar 28, 2022	text-to-speechText to Speech	—Unverified
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation	Apr 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech	Nov 4, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN	Oct 27, 2023	DecoderDenoising	—Unverified
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models	Oct 6, 2021	text-to-speechText to Speech	—Unverified
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis	Sep 24, 2024	Speech Synthesistext-to-speech	—Unverified
Style Mixture of Experts for Expressive Text-To-Speech Synthesis	Jun 5, 2024	Mixture-of-ExpertsSpeech Synthesis	—Unverified
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech	Mar 17, 2021	Speech SynthesisStyle Transfer	—Unverified
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation	Aug 13, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion	Sep 16, 2024	Speech Synthesistext-to-speech	—Unverified
Style Variation as a Vantage Point for Code-Switching	May 1, 2020	Language ModelingLanguage Modelling	—Unverified
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System	Mar 29, 2025	Speech Synthesistext-to-speech	—Unverified
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition	Jun 5, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer	Feb 16, 2025	text-to-speechText to Speech	—Unverified
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal	Dec 13, 2020	DiversityRepresentation Learning	—Unverified
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech	Nov 24, 2020	Data AugmentationSpeaker Recognition	—Unverified
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments	Jul 23, 2024	DiversityKeyword Spotting	—Unverified
SynthASR: Unlocking Synthetic Data for Speech Recognition	Jun 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 25 of 29Next →

No leaderboard results yet.