Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 1419 papers

Title	Date	Tasks	Status	Hype
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument	Feb 13, 2025	Audio GenerationDecoder	CodeCode Available	2
ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech	Feb 13, 2025	Adversarial AttackAdversarial Attack Detection	—Unverified	0
LoRP-TTS: Low-Rank Personalized Text-To-Speech	Feb 11, 2025	Speech Synthesistext-to-speech	—Unverified	0
Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement	Feb 11, 2025	Disentanglementtext-to-speech	—Unverified	0
Synthetic Audio Helps for Cognitive State Tasks	Feb 10, 2025	text-to-speechText to Speech	CodeCode Available	0
Speech to Speech Translation with Translatotron: A State of the Art Review	Feb 9, 2025	speech-recognitionSpeech Recognition	—Unverified	0
Gender Bias in Instruction-Guided Speech Synthesis Models	Feb 8, 2025	Expressive Speech SynthesisSpeech Synthesis	—Unverified	0
ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts	Feb 8, 2025	BenchmarkingSelf-Supervised Learning	CodeCode Available	1
IndexTTS: An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System	Feb 8, 2025	DecoderLanguage Modeling	CodeCode Available	11
Metis: A Foundation Speech Generation Model with Masked Generative Pre-training	Feb 5, 2025	Self-Supervised LearningSpeech Enhancement	CodeCode Available	9
Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech	Feb 5, 2025	Language ModelingLanguage Modelling	—Unverified	0
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation	Feb 4, 2025	Change DetectionGender Classification	—Unverified	0
Developing multilingual speech synthesis system for Ojibwe, Mi'kmaq, and Maliseet	Feb 4, 2025	Speech Synthesistext-to-speech	CodeCode Available	1
EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis	Feb 2, 2025	Self-Supervised LearningSSIM	—Unverified	0
VisualSpeech: Enhance Prosody with Visual Context in TTS	Jan 31, 2025	Prosody Predictiontext-to-speech	—Unverified	0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights	Jan 29, 2025	Language ModelingLanguage Modelling	—Unverified	0
Compact Neural TTS Voices for Accessibility	Jan 28, 2025	Speech Synthesistext-to-speech	—Unverified	0
Overview of the Amphion Toolkit (v0.2)	Jan 26, 2025	text-to-speechText to Speech	CodeCode Available	9
Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation	Jan 24, 2025	Audio Deepfake DetectionDeepFake Detection	—Unverified	0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models	Jan 24, 2025	Emotion ClassificationSpeaker Identification	—Unverified	0
LoCoML: A Framework for Real-World ML Inference Pipelines	Jan 24, 2025	Automatic Speech RecognitionMachine Translation	—Unverified	0
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement	Jan 23, 2025	Data AugmentationSpeech Enhancement	—Unverified	0
Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement	Jan 22, 2025	Object Recognitionspeech-recognition	—Unverified	0
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data	Jan 21, 2025	Domain Adaptationspeech-recognition	—Unverified	0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement	Jan 15, 2025	Computational EfficiencyCPU	—Unverified	0

Show:10 25 50

← PrevPage 6 of 57Next →

No leaderboard results yet.