SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 551600 of 1419 papers

TitleStatusHype
DNN-based Speech Synthesis for Indian Languages from ASCII text0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data0
Distribution augmentation for low-resource expressive text-to-speech0
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing0
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech0
Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS0
An In-depth Analysis of the Effect of Text Normalization in Social Media0
DiscreTalk: Text-to-Speech as a Machine Translation Problem0
Discovering the Italian literature: interactive access to audio indexed text resources0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
Direct Text to Speech Translation System using Acoustic Units0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data0
A Challenge Set and Methods for Noun-Verb Ambiguity0
Voice Impression Control in Zero-Shot TTS0
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition0
Direct Speech to Speech Translation: A Review0
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
Digital Einstein Experience: Fast Text-to-Speech for Conversational AI0
DiffVoice: Text-to-Speech with Latent Diffusion0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
Diff-TTS: A Denoising Diffusion Model for Text-to-Speech0
AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis0
DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles0
Auto Spell Suggestion for High Quality Speech Synthesis in Hindi0
An Expert System for Automatic Reading of A Text Written in Standard Arabic0
ADEPT: A Dataset for Evaluating Prosody Transfer0
DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs0
Autoregressive Speech Synthesis without Vector Quantization0
AutoTTS: End-to-End Text-to-Speech Synthesis through Differentiable Duration Modeling0
Autoregressive Speech Synthesis with Next-Distribution Prediction0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech0
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis0
Diacritization of Maghrebi Arabic Sub-Dialects0
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech0
An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR0
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech0
Development of Smartcall Vietnamese Text-to-Speech for VLSP 20200
Automatic Speech Recognition for Hindi0
Development of Marathi Part of Speech Tagger Using Statistical Approach0
Show:102550
← PrevPage 12 of 29Next →

No leaderboard results yet.