SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 13011350 of 1419 papers

TitleStatusHype
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
DASB -- Discrete Audio and Speech Benchmark0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys0
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System0
Data Redaction from Conditional Generative Models0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Debatts: Zero-Shot Debating Text-to-Speech Synthesis0
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation0
Deep Denoising Auto-encoder for Statistical Speech Synthesis0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Deep Performer: Score-to-Audio Music Performance Synthesis0
Deep Shallow Fusion for RNN-T Personalization0
Deep Text-to-Speech System with Seq2Seq Model0
Deliberation Model for On-Device Spoken Language Understanding0
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition0
Denoising Text to Speech with Frame-Level Noise Modeling0
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis0
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control0
Designing French Tale Corpora for Entertaining Text To Speech Synthesis0
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention0
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
Development and Evaluation of Speech Synthesis Corpora for Latvian0
Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement0
Development of Marathi Part of Speech Tagger Using Statistical Approach0
Development of Smartcall Vietnamese Text-to-Speech for VLSP 20200
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech0
Diacritization of Maghrebi Arabic Sub-Dialects0
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration Modeling0
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs0
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles0
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech0
DiffVoice: Text-to-Speech with Latent Diffusion0
Digital Einstein Experience: Fast Text-to-Speech for Conversational AI0
Direct Speech to Speech Translation: A Review0
Direct Text to Speech Translation System using Acoustic Units0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
Discovering the Italian literature: interactive access to audio indexed text resources0
DiscreTalk: Text-to-Speech as a Machine Translation Problem0
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech0
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing0
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage0
Distribution augmentation for low-resource expressive text-to-speech0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
DNN-based Speech Synthesis for Indian Languages from ASCII text0
Show:102550
← PrevPage 27 of 29Next →

No leaderboard results yet.