SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 801850 of 1419 papers

TitleStatusHype
Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding0
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations0
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue0
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech0
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech0
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data0
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS0
Towards Optimizing OCR for Accessibility0
Automatic Prosody Annotation with Pre-Trained Text-Speech ModelCode1
NatiQ: An End-to-end Text-to-Speech System for Arabic0
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep LearningCode1
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation0
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos0
FlexLip: A Controllable Text-to-Lip System0
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-SpeechCode1
BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus0
The Nós Project: Opening routes for the Galician language in the field of language technologies0
Reading Assistance through LARA, the Learning And Reading Assistant0
Exploring Transfer Learning for Urdu Speech Synthesis0
An Open Source Web Reader for Under-Resourced LanguagesCode0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks0
ParlamentParla: A Speech Corpus of Catalan Parliamentary Sessions0
Audiobook Dialogues as Training Data for Conversational Style Synthetic Voices0
Using the LARA Little Prince to compare human and TTS audio quality0
Error Annotation in Post-Editing Machine Translation: Investigating the Impact of Text-to-Speech Technology0
Preparing an Endangered Language for the Digital Age: The Case of Judeo-SpanishCode0
StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech SynthesisCode2
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
QSpeech: Low-Qubit Quantum Speech Application ToolkitCode0
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation0
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-SpeechCode2
Talking Face Generation with Multilingual TTS0
NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level QualityCode2
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-SpeechCode1
ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence0
Systematic Inequalities in Language Technology Performance across the World’s LanguagesCode0
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022Code0
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech SynthesisCode2
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech0
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation0
Show:102550
← PrevPage 17 of 29Next →

No leaderboard results yet.