SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 301350 of 1419 papers

TitleStatusHype
Can Emotion Fool Anti-spoofing?0
A Framework for Synthetic Audio Conversations Generation using Large Language Models0
Can DeepFake Speech be Reliably Detected?0
BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus0
Applying Syntaxx2013Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis0
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge0
Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech0
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems0
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis0
Bunched LPCNet2: Efficient Neural Vocoders Covering Devices from Cloud to Edge0
Building Text-To-Speech Voices in the Cloud0
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech0
A Context-Based Numerical Format Prediction for a Text-To-Speech System0
Building Text-to-Speech Systems for Resource Poor Languages0
Building Synthetic Speaker Profiles in Text-to-Speech Systems0
Applying Automated Machine Translation to Educational Video Courses0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech0
Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models0
AE-Flow: AutoEncoder Normalizing Flow0
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis0
Building a mixed-lingual neural TTS system with only monolingual data0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Building a Luganda Text-to-Speech Model From Crowdsourced Data0
基於字元階層之語音合成用文脈訊息擷取(Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
台語古詩朗誦系統A Taiwanese Text-to-Speech System for Ancient Poems[In Chinese]0
DiffVoice: Text-to-Speech with Latent Diffusion0
Direct Speech to Speech Translation: A Review0
BUCEADOR, a multi-language search engine for digital libraries0
基於字元階層之語音合成用文脈訊息擷取 (Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting0
Bridging the Gap: An Intermediate Language for Enhanced and Cost-Effective Grapheme-to-Phoneme Conversion with Homographs with Multiple Pronunciations Disambiguation0
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
Braille-to-Speech Generator: Audio Generation Based on Joint Fine-Tuning of CLIP and Fastspeech20
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech0
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck0
Adversarial speech for voice privacy protection from Personalized Speech generation0
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
Boosting Large Language Model for Speech Synthesis: An Empirical Study0
An overview of text-to-speech systems and media applications0
Boosting Diffusion Model for Spectrogram Up-sampling in Text-to-speech: An Empirical Study0
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech0
BiVocoder: A Bidirectional Neural Vocoder Integrating Feature Extraction and Waveform Generation0
BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing0
Show:102550
← PrevPage 7 of 29Next →

No leaderboard results yet.