Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 1419 papers

Title	Date	Tasks	Status
ParlamentParla: A Speech Corpus of Catalan Parliamentary Sessions	Jun 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations	Mar 1, 2023	Self-Supervised LearningSpeech Synthesis	—Unverified
PauseSpeech: Natural Speech Synthesis via Pre-trained Language Model and Pause-based Prosody Modeling	Jun 13, 2023	Language ModelingLanguage Modelling	—Unverified
Penambahan emosi menggunakan metode manipulasi prosodi untuk sistem text to speech bahasa Indonesia	Jun 29, 2016	Sentencetext-to-speech	—Unverified
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech	Nov 2, 2020	Knowledge DistillationSpeech Synthesis	—Unverified
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis	Oct 28, 2022	DecoderDiversity	—Unverified
Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice	Jun 14, 2024	text-to-speechText to Speech	—Unverified
Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes	Dec 17, 2024	DeepFake DetectionFace Swapping	—Unverified
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis	Jun 4, 2024	In-Context LearningLanguage Modeling	—Unverified
Phonikud: Hebrew Grapheme-to-Phoneme Conversion for Real-Time Text-to-Speech	Jun 14, 2025	Grapheme-to-Phoneme Conversiontext-to-speech	—Unverified
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end	Jan 24, 2022	Morphological AnalysisPolyphone disambiguation	—Unverified
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features	Jul 3, 2019	Polyphone disambiguationSentence	—Unverified
Positional Description for Numerical Normalization	Aug 22, 2024	speech-recognitionSpeech Recognition	—Unverified
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar	Oct 13, 2022	text-to-speechText to Speech	—Unverified
PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction	Jun 18, 2025	Sentencetext-to-speech	—Unverified
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis	Aug 4, 2018	Speech Synthesistext-to-speech	—Unverified
Preference Alignment Improves Language Model-Based TTS	Sep 19, 2024	Language ModelingLanguage Modelling	—Unverified
Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling	Apr 14, 2024	Polyphone disambiguationText Normalization	—Unverified
Probing Deep Speaker Embeddings for Speaker-related Tasks	Dec 14, 2022	Speaker RecognitionSpeaker Verification	—Unverified
Probing Speaker-specific Features in Speaker Representations	Jan 9, 2025	Self-Supervised LearningSpeaker Verification	—Unverified
PROEMO: Prompt-Driven Text-to-Speech Synthesis Based on Emotion and Intensity Control	Jan 10, 2025	Speech Synthesistext-to-speech	—Unverified
PSCodec: A Series of High-Fidelity Low-bitrate Neural Speech Codecs Leveraging Prompt Encoders	Apr 3, 2024	Representation LearningSpeaker Verification	—Unverified
PromptTTS 2: Describing and Generating Voices with Text Prompt	Sep 5, 2023	Language ModellingLarge Language Model	—Unverified
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions	Sep 15, 2023	text-to-speechText to Speech	—Unverified
Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions	Jun 3, 2025	Expressive Speech SynthesisPrompt Learning	—Unverified
Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis	Nov 19, 2021	ClusteringDecoder	—Unverified
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech	Nov 4, 2020	Graph AttentionRepresentation Learning	—Unverified
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech	Jun 24, 2022	text-to-speechText to Speech	—Unverified
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis	Dec 16, 2024	Speech Synthesistext-to-speech	—Unverified
Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features	Nov 21, 2019	text-to-speechText to Speech	—Unverified
Prosody-TTS: An end-to-end speech synthesis system with prosody control	Oct 6, 2021	RhythmSpeech Synthesis	—Unverified
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech	Feb 16, 2022	text-to-speechText to Speech	—Unverified
The Zero Resource Speech Challenge 2019: TTS without T	Apr 25, 2019	text-to-speechText to Speech	—Unverified
From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories	Aug 20, 2019	RetrievalTAG	—Unverified
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition	Jul 31, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Handling Numeric Expressions in Automatic Speech Recognition	Jul 18, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation	Aug 1, 2024	Representation LearningSpeech Synthesis	—Unverified
Enhancing Kurdish Text-to-Speech with Native Corpus Training: A High-Quality WaveGlow Vocoder Approach	Sep 10, 2024	Speech Synthesistext-to-speech	—Unverified
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech	May 15, 2025	Emotional Speech SynthesisLanguage Modeling	—Unverified
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese	May 16, 2025	BenchmarkingLanguage Modeling	—Unverified
Voice Impression Control in Zero-Shot TTS	Jun 6, 2025	Language ModelingLanguage Modelling	—Unverified
Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs	Jun 12, 2025	Speech-to-Speech Translationtext-to-speech	—Unverified
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge	Aug 30, 2024	DeepFake DetectionFace Swapping	—Unverified
A Bengali HMM Based Speech Synthesis System	Jun 16, 2014	Speech Synthesistext-to-speech	—Unverified
Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling	May 26, 2025	GPUtext-to-speech	—Unverified
AccentBox: Towards High-Fidelity Zero-Shot Accent Generation	Sep 13, 2024	text-to-speechText to Speech	—Unverified
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training	Jun 3, 2024	Speech Synthesistext-to-speech	—Unverified
Accent conversion using discrete units with parallel data synthesized from controllable accented TTS	Sep 30, 2024	Data AugmentationSpeech Synthesis	—Unverified
Accented Text-to-Speech Synthesis with Limited Data	May 8, 2023	Speech Synthesistext-to-speech	—Unverified
A Challenge Set and Methods for Noun-Verb Ambiguity	Oct 1, 2018	Speech Synthesistext-to-speech	—Unverified

Show:10 25 50

← PrevPage 15 of 29Next →

No leaderboard results yet.