Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–675 of 1419 papers

Title	Date	Tasks	Status
Boosting Large Language Model for Speech Synthesis: An Empirical Study	Dec 30, 2023	Language ModelingLanguage Modelling	—Unverified
Normalization of Lithuanian Text Using Regular Expressions	Dec 29, 2023	Speech SynthesisText Normalization	—Unverified
AE-Flow: AutoEncoder Normalizing Flow	Dec 27, 2023	text-to-speechText to Speech	—Unverified
Creating New Voices using Normalizing Flows	Dec 22, 2023	Speech Synthesistext-to-speech	—Unverified
External Knowledge Augmented Polyphone Disambiguation Using Large Language Model	Dec 19, 2023	DecoderLanguage Modeling	—Unverified
A review-based study on different Text-to-Speech technologies	Dec 17, 2023	text-to-speechText to Speech	—Unverified
MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis	Dec 17, 2023	Speech SynthesisStyle Transfer	—Unverified
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis	Dec 8, 2023	BenchmarkingQuantization	—Unverified
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis	Dec 6, 2023	Speech Synthesistext-to-speech	—Unverified
Rapid Speaker Adaptation in Low Resource Text to Speech Systems using Synthetic Data and Transfer learning	Dec 2, 2023	Decodertext-to-speech	—Unverified
Code-Mixed Text to Speech Synthesis under Low-Resource Constraints	Dec 2, 2023	Speech Synthesistext-to-speech	—Unverified
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes	Nov 29, 2023	Face RecognitionFace Swapping	—Unverified
Guided Flows for Generative Modeling and Decision Making	Nov 22, 2023	Conditional Image GenerationDecision Making	—Unverified
Utilizing Speech Emotion Recognition and Recommender Systems for Negative Emotion Handling in Therapy Chatbots	Nov 18, 2023	ChatbotEmotion Recognition	—Unverified
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys	Nov 18, 2023	text-to-speechText to Speech	—Unverified
A Study on Altering the Latent Space of Pretrained Text to Speech Models for Improved Expressiveness	Nov 17, 2023	text-to-speechText to Speech	—Unverified
ChatAnything: Facetime Chat with LLM-Enhanced Personas	Nov 12, 2023	Image GenerationIn-Context Learning	—Unverified
Synthetic Speaking Children -- Why We Need Them and How to Make Them	Nov 8, 2023	text-to-speechText to Speech	—Unverified
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment	Nov 7, 2023	DecoderPosition	—Unverified
Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction	Nov 6, 2023	text-to-speechText to Speech	—Unverified
E3 TTS: Easy End-to-End Diffusion-based Text to Speech	Nov 2, 2023	text-to-speechText to Speech	—Unverified
Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations	Nov 2, 2023	Language ModelingLanguage Modelling	—Unverified
Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN	Oct 27, 2023	DecoderDenoising	—Unverified
Generative Pre-training for Speech with Flow Matching	Oct 25, 2023	Speech EnhancementSpeech Synthesis	—Unverified
DPP-TTS: Diversifying prosodic features of speech via determinantal point processes	Oct 23, 2023	DiversityPoint Processes	—Unverified

Show:10 25 50

← PrevPage 27 of 57Next →

No leaderboard results yet.