SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 376400 of 1419 papers

TitleStatusHype
An In-depth Analysis of the Effect of Text Normalization in Social Media0
Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation0
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-trained BERT0
Direct Text to Speech Translation System using Acoustic Units0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
Voice Impression Control in Zero-Shot TTS0
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment0
Efficient data selection employing Semantic Similarity-based Graph Structures for model training0
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS0
Discovering the Italian literature: interactive access to audio indexed text resources0
DiscreTalk: Text-to-Speech as a Machine Translation Problem0
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech0
Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing0
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
DisfluencySpeech -- Single-Speaker Conversational Speech Dataset with Paralanguage0
Distribution augmentation for low-resource expressive text-to-speech0
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
DMOSpeech: Direct Metric Optimization via Distilled Diffusion Model in Zero-Shot Speech Synthesis0
Direct Speech to Speech Translation: A Review0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
Do Prosody Transfer Models Transfer Prosody?0
DPI-TTS: Directional Patch Interaction for Fast-Converging and Style Temporal Modeling in Text-to-Speech0
DPP-TTS: Diversifying prosodic features of speech via determinantal point processes0
Digital Einstein Experience: Fast Text-to-Speech for Conversational AI0
Show:102550
← PrevPage 16 of 57Next →

No leaderboard results yet.