SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 826850 of 1419 papers

TitleStatusHype
ParlamentParla: A Speech Corpus of Catalan Parliamentary Sessions0
Audiobook Dialogues as Training Data for Conversational Style Synthetic Voices0
Using the LARA Little Prince to compare human and TTS audio quality0
Error Annotation in Post-Editing Machine Translation: Investigating the Impact of Text-to-Speech Technology0
Preparing an Endangered Language for the Digital Age: The Case of Judeo-SpanishCode0
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech SynthesisCode2
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
QSpeech: Low-Qubit Quantum Speech Application ToolkitCode0
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation0
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-SpeechCode2
Talking Face Generation with Multilingual TTS0
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level QualityCode2
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-SpeechCode1
ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence0
Systematic Inequalities in Language Technology Performance across the World’s LanguagesCode0
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022Code0
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech SynthesisCode2
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech0
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation0
Show:102550
← PrevPage 34 of 57Next →

No leaderboard results yet.