Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 626–650 of 1419 papers

Title	Date	Tasks	Status
Deep Shallow Fusion for RNN-T Personalization	Nov 16, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Deep Performer: Score-to-Audio Music Performance Synthesis	Feb 12, 2022	DecoderSpeech Synthesis	—Unverified
A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages	Oct 18, 2024	Speech Synthesistext-to-speech	—Unverified
Deep Feed-forward Sequential Memory Networks for Speech Synthesis	Feb 26, 2018	speech-recognitionSpeech Recognition	—Unverified
Augmenting text for spoken language understanding with Large Language Models	Sep 17, 2023	Semantic ParsingSpoken Language Understanding	—Unverified
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments	Jul 14, 2025	Speech-to-Texttext-to-speech	—Unverified
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios	Apr 1, 2022	Speech Synthesistext-to-speech	—Unverified
Deep Denoising Auto-encoder for Statistical Speech Synthesis	Jun 17, 2015	DenoisingSpeech Synthesis	—Unverified
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation	Mar 28, 2025	Audio GenerationAudio-Visual Synchronization	—Unverified
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework	Nov 4, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Debatts: Zero-Shot Debating Text-to-Speech Synthesis	Nov 10, 2024	Speech Synthesistext-to-speech	—Unverified
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack	Sep 11, 2024	Adversarial AttackAudio Synthesis	—Unverified
Augmentation through Laundering Attacks for Audio Spoof Detection	Oct 1, 2024	Data AugmentationFace Swapping	—Unverified
Data Redaction from Conditional Generative Models	May 18, 2023	text-to-speechText to Speech	—Unverified
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System	Apr 20, 2020	text-to-speechText to Speech	—Unverified
Data Efficient Voice Cloning for Neural Singing Synthesis	Feb 19, 2019	text-to-speechText to Speech	—Unverified
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech	Jan 19, 2024	Self-Supervised Learningtext-to-speech	—Unverified
AudioVisual Speech Synthesis: A brief literature review	Feb 18, 2021	Speech Synthesistext-to-speech	—Unverified
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style	Jul 6, 2021	DecoderMixture-of-Experts	—Unverified
Accented Text-to-Speech Synthesis with Limited Data	May 8, 2023	Speech Synthesistext-to-speech	—Unverified
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys	Nov 18, 2023	text-to-speechText to Speech	—Unverified
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios	Jun 7, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
DASB -- Discrete Audio and Speech Benchmark	Jun 20, 2024	BenchmarkingEmotion Recognition	—Unverified
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech	Oct 17, 2024	DisentanglementQuantization	—Unverified
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue	Dec 7, 2022	Spoken Dialogue Systemstext-to-speech	—Unverified

Show:10 25 50

← PrevPage 26 of 57Next →

No leaderboard results yet.