Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 676–700 of 1419 papers

Title	Date	Tasks	Status
An overview of text-to-speech systems and media applications	Oct 22, 2023	Acoustic Modellingtext-to-speech	—Unverified
Attentive Multi-Layer Perceptron for Non-autoregressive Generation	Oct 14, 2023	Machine TranslationSpeech Synthesis	CodeCode Available
On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition	Oct 12, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Prosody Analysis of Audiobooks	Oct 10, 2023	AttributeLanguage Modeling	CodeCode Available
Neutral TTS Female Voice Corpus in Brazilian Portuguese	Oct 8, 2023	Speech Synthesistext-to-speech	—Unverified
Unified speech and gesture synthesis using flow matching	Oct 8, 2023	Audio SynthesisMotion Synthesis	—Unverified
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset	Oct 8, 2023	text-to-speechText to Speech	—Unverified
Latent Filling: Latent Space Data Augmentation for Zero-shot Speech Synthesis	Oct 5, 2023	Data AugmentationSpeech Synthesis	—Unverified
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains	Oct 4, 2023	Speech Synthesistext-to-speech	—Unverified
Towards human-like spoken dialogue generation between AI agents from written dialogue	Oct 2, 2023	Dialogue Generationtext-to-speech	—Unverified
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS	Sep 29, 2023	Self-Supervised Learningtext-to-speech	—Unverified
Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features	Sep 29, 2023	Synthetic Speech Detectiontext-to-speech	—Unverified
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models	Sep 27, 2023	AllSpeech Synthesis	—Unverified
Face-StyleSpeech: Enhancing Zero-shot Speech Synthesis from Face Images with Improved Face-to-Speech Mapping	Sep 25, 2023	Speech Synthesistext-to-speech	—Unverified
VoiceLDM: Text-to-Speech with Environmental Context	Sep 24, 2023	AudioCapstext-to-speech	—Unverified
DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis	Sep 22, 2023	DenoisingSpeech Synthesis	—Unverified
The Impact of Silence on Speech Anti-Spoofing	Sep 21, 2023	Action DetectionActivity Detection	—Unverified
Speak While You Think: Streaming Speech Synthesis During Text Generation	Sep 20, 2023	Speech SynthesisText Generation	—Unverified
Exploring Speech Enhancement for Low-resource Speech Synthesis	Sep 19, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition	Sep 19, 2023	Data AugmentationEmotion Recognition	—Unverified
Augmenting text for spoken language understanding with Large Language Models	Sep 17, 2023	Semantic ParsingSpoken Language Understanding	—Unverified
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions	Sep 15, 2023	text-to-speechText to Speech	—Unverified
Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech	Sep 15, 2023	Knowledge DistillationSpeech Synthesis	—Unverified
Direct Text to Speech Translation System using Acoustic Units	Sep 14, 2023	DecoderSpeech-to-Speech Translation	—Unverified
Cross-Utterance Conditioned VAE for Speech Generation	Sep 8, 2023	Speech Synthesistext-to-speech	—Unverified

Show:10 25 50

← PrevPage 28 of 57Next →

No leaderboard results yet.