SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 376400 of 1419 papers

TitleStatusHype
ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech0
Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement0
LoRP-TTS: Low-Rank Personalized Text-To-Speech0
Synthetic Audio Helps for Cognitive State TasksCode0
Speech to Speech Translation with Translatotron: A State of the Art Review0
Gender Bias in Instruction-Guided Speech Synthesis Models0
Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech0
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation0
EmoTalkingGaussian: Continuous Emotion-conditioned Talking Head Synthesis0
VisualSpeech: Enhance Prosody with Visual Context in TTS0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
Compact Neural TTS Voices for Accessibility0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation0
LoCoML: A Framework for Real-World ML Inference Pipelines0
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement0
Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement0
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data0
Speech Synthesis along Perceptual Voice Quality Dimensions0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement0
AI-Powered Assistive Technologies for Visual Impairment0
MARS6: A Small and Robust Hierarchical-Codec Text-to-Speech Model0
PROEMO: Prompt-Driven Text-to-Speech Synthesis Based on Emotion and Intensity Control0
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer0
Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron0
Show:102550
← PrevPage 16 of 57Next →

No leaderboard results yet.