Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–175 of 1419 papers

Title	Date	Tasks	Status	Hype
Speech Synthesis along Perceptual Voice Quality Dimensions	Jan 15, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
AI-Powered Assistive Technologies for Visual Impairment	Jan 14, 2025	Object Recognitiontext-to-speech	—Unverified	0
MathReader : Text-to-Speech for Mathematical Documents	Jan 13, 2025	Optical Character Recognition (OCR)text-to-speech	CodeCode Available	1
PROEMO: Prompt-Driven Text-to-Speech Synthesis Based on Emotion and Intensity Control	Jan 10, 2025	Speech Synthesistext-to-speech	—Unverified	0
Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron	Jan 10, 2025	Speech Synthesistext-to-speech	—Unverified	0
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer	Jan 10, 2025	speech-recognitionSpeech Recognition	—Unverified	0
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction	Jan 10, 2025	Instruction FollowingLanguage Modeling	—Unverified	0
MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model	Jan 10, 2025	DecoderLanguage Modelling	—Unverified	0
Probing Speaker-specific Features in Speaker Representations	Jan 9, 2025	Self-Supervised LearningSpeaker Verification	—Unverified	0
Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model	Jan 8, 2025	text-to-speechText to Speech	—Unverified	0
FaceSpeak: Expressive and High-Quality Speech Synthesis from Human Portraits of Different Styles	Jan 2, 2025	Speech Synthesistext-to-speech	—Unverified	0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT	Jan 2, 2025	Polyphone disambiguationSentence	—Unverified	0
RingFormer: A Neural Vocoder with Ring Attention and Convolution-Augmented Transformer	Jan 2, 2025	Audio Generationtext-to-speech	CodeCode Available	2
Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting	Dec 28, 2024	Speech Synthesistext-to-speech	—Unverified	0
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities	Dec 26, 2024	Domain AdaptationLanguage Modeling	CodeCode Available	0
Indonesian-English Code-Switching Speech Synthesizer Utilizing Multilingual STEN-TTS and Bert LID	Dec 26, 2024	Language Identificationtext-to-speech	—Unverified	0
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset	Dec 25, 2024	text-to-speechText to Speech	—Unverified	0
Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective	Dec 22, 2024	text-to-speechText to Speech	—Unverified	0
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis	Dec 22, 2024	DecoderDisentanglement	—Unverified	0
Autoregressive Speech Synthesis with Next-Distribution Prediction	Dec 22, 2024	Language ModelingLanguage Modelling	—Unverified	0
Interleaved Speech-Text Language Models are Simple Streaming Text to Speech Synthesizers	Dec 20, 2024	Language ModelingLanguage Modelling	—Unverified	0
Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling	Dec 19, 2024	AttributeSpeech Enhancement	—Unverified	0
Phoneme-Level Feature Discrepancies: A Key to Detecting Sophisticated Speech Deepfakes	Dec 17, 2024	DeepFake DetectionFace Swapping	—Unverified	0
Enhancing Naturalness in LLM-Generated Utterances through Disfluency Insertion	Dec 17, 2024	text-to-speechText to Speech	—Unverified	0
ProsodyFM: Unsupervised Phrasing and Intonation Control for Intelligible Speech Synthesis	Dec 16, 2024	Speech Synthesistext-to-speech	—Unverified	0

Show:10 25 50

← PrevPage 7 of 57Next →

No leaderboard results yet.