SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 10011050 of 1419 papers

TitleStatusHype
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System0
Data Redaction from Conditional Generative Models0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Towards Selection of Text-to-speech Data to Augment ASR Training0
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis0
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models0
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders0
Towards Zero-Shot Text-To-Speech for Arabic Dialects0
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora0
Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems0
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation0
Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction0
Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus0
Transfer the linguistic representations from TTS to accent conversion with non-parallel data0
Transformer-based Models of Text Normalization for Speech Applications0
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis0
Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet0
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder0
TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems0
TTS for Low Resource Languages: A Bangla Synthesizer0
TTS-Guided Training for Accent Conversion Without Parallel Data0
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations0
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer0
UmbraTTS: Adapting Text-to-Speech to Environmental Contexts with Flow Matching0
Une aide \`a la communication par pictogrammes avec pr\'ediction s\'emantique0
UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation0
Unified speech and gesture synthesis using flow matching0
UniFLG: Unified Facial Landmark Generator from Text or Speech0
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)0
UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion0
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation0
Unsupervised Data Validation Methods for Efficient Model Training0
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Unsupervised Polyglot Text To Speech0
Unsupervised pre-training for sequence to sequence speech recognition0
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis0
Unsupervised word-level prosody tagging for controllable speech synthesis0
Controllable Speaking Styles Using a Large Language Model0
Using Audio Books for Training a Text-to-Speech System0
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition0
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement0
Using previous acoustic context to improve Text-to-Speech synthesis0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Using the LARA Little Prince to compare human and TTS audio quality0
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech0
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction0
Utilizing Speech Emotion Recognition and Recommender Systems for Negative Emotion Handling in Therapy Chatbots0
Show:102550
← PrevPage 21 of 29Next →

No leaderboard results yet.