SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 9511000 of 1419 papers

TitleStatusHype
Une aide \`a la communication par pictogrammes avec pr\'ediction s\'emantique0
UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation0
Unified speech and gesture synthesis using flow matching0
UniFLG: Unified Facial Landmark Generator from Text or Speech0
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)0
UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion0
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation0
Unsupervised Data Validation Methods for Efficient Model Training0
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages0
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis0
Unsupervised Polyglot Text To Speech0
Unsupervised pre-training for sequence to sequence speech recognition0
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis0
Unsupervised word-level prosody tagging for controllable speech synthesis0
Controllable Speaking Styles Using a Large Language Model0
Using Audio Books for Training a Text-to-Speech System0
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition0
Using IPA-Based Tacotron for Data Efficient Cross-Lingual Speaker Adaptation and Pronunciation Enhancement0
Using previous acoustic context to improve Text-to-Speech synthesis0
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Using the LARA Little Prince to compare human and TTS audio quality0
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech0
Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction0
Utilizing Speech Emotion Recognition and Recommender Systems for Negative Emotion Handling in Therapy Chatbots0
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
UzbekTagger: The rule-based POS tagger for Uzbek language0
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages0
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers0
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment0
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech0
VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention0
可變速中文文字轉語音系統 (Variable Speech Rate Mandarin Chinese Text-to-Speech System) [In Chinese]0
Varianceflow: High-Quality and Controllable Text-to-Speech using Variance Information via Normalizing Flow0
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech0
Vers une annotation automatique de corpus audio pour la synth\`ese de parole (Towards Fully Automatic Annotation of Audio Books for Text-To-Speech (TTS) Synthesis) [in French]0
Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement0
ViDA-MAN: Visual Dialog with Digital Humans0
Vietnamese Text-To-Speech Shared Task VLSP 2020: Remaining problems with state-of-the-art techniques0
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis0
Visual-Aware Text-to-Speech0
VisualSpeech: Enhance Prosody with Visual Context in TTS0
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over0
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer0
Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise0
VocalEyes: Enhancing Environmental Perception for the Visually Impaired through Vision-Language Models and Distance-Aware Object Detection0
Voice-Assisted Real-Time Traffic Sign Recognition System Using Convolutional Neural Network0
Voice Builder: A Tool for Building Text-To-Speech Voices0
Show:102550
← PrevPage 20 of 29Next →

No leaderboard results yet.