Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 1419 papers

Title	Date	Tasks	Status
The ILMT-s2s Corpus â€• A Multimodal Interlingual Map Task Corpus	May 1, 2016	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
The Impact of Silence on Speech Anti-Spoofing	Sep 21, 2023	Action DetectionActivity Detection	—Unverified
The MSXF TTS System for ICASSP 2022 ADD Challenge	Jan 27, 2022	text-to-speechText to Speech	—Unverified
The Nós Project: Opening routes for the Galician language in the field of language technologies	Jun 1, 2022	Cultural Vocal Bursts Intensity PredictionMachine Translation	—Unverified
The NTU-AISG Text-to-speech System for Blizzard Challenge 2020	Oct 22, 2020	text-to-speechText to Speech	—Unverified
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance	Apr 11, 2022	Speaker VerificationSpeech Synthesis	—Unverified
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach	Oct 14, 2019	Expressive Speech SynthesisSociology	—Unverified
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains	Oct 4, 2023	Speech Synthesistext-to-speech	—Unverified
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge	Apr 9, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain	Jun 3, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality	Apr 27, 2024	Imputationtext-to-speech	—Unverified
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation	May 24, 2022	DecoderMachine Translation	—Unverified
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion	Apr 6, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Total-Duration-Aware Duration Modeling for Text-to-Speech Systems	Jun 6, 2024	Diversitytext-to-speech	—Unverified
Towards Accurate Lip-to-Speech Synthesis in-the-Wild	Mar 2, 2024	Language ModellingLip to Speech Synthesis	—Unverified
Towards a Japanese Full-duplex Spoken Dialogue System	Jun 3, 2025	Spoken Dialogue Systemstext-to-speech	—Unverified
Towards a Language Service Infrastructure for Mobile Environments	May 1, 2016	text-to-speechText to Speech	—Unverified
Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer	May 15, 2024	Adversarial AttackAutomatic Speech Recognition	—Unverified
Towards Flow-Matching-based TTS without Classifier-Free Guidance	Apr 29, 2025	Speech Synthesistext-to-speech	—Unverified
Towards Fully Automatic Annotation of Audio Books for TTS	May 1, 2012	Speech RecognitionSpeech Synthesis	—Unverified
Towards human-like spoken dialogue generation between AI agents from written dialogue	Oct 2, 2023	Dialogue Generationtext-to-speech	—Unverified
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement	Jan 15, 2025	Computational EfficiencyCPU	—Unverified
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale	Aug 21, 2022	LipreadingLip Reading	—Unverified
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram	Feb 3, 2021	text-to-speechText to Speech	—Unverified
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion	Oct 16, 2020	Speech Synthesistext-to-speech	—Unverified
Towards Optimizing OCR for Accessibility	Jun 21, 2022	Optical Character Recognition (OCR)text-to-speech	—Unverified
Towards Robust FastSpeech 2 by Modelling Residual Multimodality	Jun 2, 2023	Decodertext-to-speech	—Unverified
Towards Robust Neural Vocoding for Speech Generation: A Survey	Dec 5, 2019	Speech SynthesisSurvey	—Unverified
Towards Selection of Text-to-speech Data to Augment ASR Training	May 30, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis	Aug 31, 2023	Expressive Speech SynthesisSentence	—Unverified
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models	Jun 17, 2019	DecoderSpeech Synthesis	—Unverified
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders	Oct 28, 2022	Speaker Verificationtext-to-speech	—Unverified
Towards Zero-Shot Text-To-Speech for Arabic Dialects	Jun 24, 2024	Dialect IdentificationSpeech Synthesis	—Unverified
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora	Apr 1, 2019	text-to-speechText to Speech	—Unverified
Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems	Sep 4, 2024	text-to-speechText to Speech	—Unverified
Training Wake Word Detection with Synthesized Speech Data on Confusion Words	Nov 3, 2020	Data AugmentationKeyword Spotting	—Unverified
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation	Jun 9, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction	Nov 6, 2023	text-to-speechText to Speech	—Unverified
Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus	Mar 29, 2022	text-to-speechText to Speech	—Unverified
Transfer the linguistic representations from TTS to accent conversion with non-parallel data	Jan 7, 2024	text-to-speechText to Speech	—Unverified
Transformer-based Models of Text Normalization for Speech Applications	Feb 1, 2022	SentenceSpeech Synthesis	—Unverified
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis	Jul 25, 2022	Data AugmentationSpeech Synthesis	—Unverified
Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet	Jan 30, 2021	CPUSentence	—Unverified
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder	Jun 30, 2022	Speech Synthesistext-to-speech	—Unverified
TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems	Jun 24, 2025	text-to-speechText to Speech	—Unverified
TTS for Low Resource Languages: A Bangla Synthesizer	May 1, 2016	Text Normalizationtext-to-speech	—Unverified
TTS-Guided Training for Accent Conversion Without Parallel Data	Dec 20, 2022	Decodertext-to-speech	—Unverified
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations	Jul 2, 2024	Benchmarkingtext-to-speech	—Unverified
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer	Jan 10, 2025	speech-recognitionSpeech Recognition	—Unverified
UmbraTTS: Adapting Text-to-Speech to Environmental Contexts with Flow Matching	Jun 11, 2025	Speech Synthesistext-to-speech	—Unverified

Show:10 25 50

← PrevPage 19 of 29Next →

No leaderboard results yet.