SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 901950 of 1419 papers

TitleStatusHype
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
BUCEADOR, a multi-language search engine for digital libraries0
Building a Luganda Text-to-Speech Model From Crowdsourced Data0
Building a mixed-lingual neural TTS system with only monolingual data0
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis0
Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
Building Synthetic Speaker Profiles in Text-to-Speech Systems0
Building Text-to-Speech Systems for Resource Poor Languages0
Building Text-To-Speech Voices in the Cloud0
Bunched LPCNet2: Efficient Neural Vocoders Covering Devices from Cloud to Edge0
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems0
Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech0
BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus0
Can DeepFake Speech be Reliably Detected?0
Can Emotion Fool Anti-spoofing?0
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?0
Can we reconstruct a dysarthric voice with the large speech model Parler TTS?0
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data0
CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech0
CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment0
ChatAnything: Facetime Chat with LLM-Enhanced Personas0
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network0
CHULA TTS: A Modularized Text-To-Speech Framework0
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech0
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus0
Clip-TTS: Contrastive Text-content and Mel-spectrogram, A High-Quality Text-to-Speech Method based on Contextual Semantic Understanding0
CloneShield: A Framework for Universal Perturbation Against Zero-Shot Voice Cloning0
CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages0
Code-Mixed Text to Speech Synthesis under Low-Resource Constraints0
Code-Switching Text Generation and Injection in Mandarin-English ASR0
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling0
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection0
Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis0
ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios0
Compact Neural TTS Voices for Accessibility0
Comparative Analysis of Transfer Learning in Deep Learning Text-to-Speech Models on a Few-Shot, Low-Resource, Customized Dataset0
Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech0
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures0
Comparison of Grapheme-to-Phoneme Conversion Methods on a Myanmar Pronunciation Dictionary0
Comparison of Speech Representations for the MOS Prediction System0
Compress Polyphone Pronunciation Prediction Model with Shared Labels0
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need0
Conditioning Sequence-to-sequence Networks with Learned Activations0
Show:102550
← PrevPage 19 of 29Next →

No leaderboard results yet.