SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 901950 of 1419 papers

TitleStatusHype
Expressive, Variable, and Controllable Duration Modelling in TTS0
Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding0
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations0
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue0
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech0
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech0
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data0
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS0
Towards Optimizing OCR for Accessibility0
NatiQ: An End-to-end Text-to-Speech System for Arabic0
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation0
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos0
FlexLip: A Controllable Text-to-Lip System0
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
Audiobook Dialogues as Training Data for Conversational Style Synthetic Voices0
Error Annotation in Post-Editing Machine Translation: Investigating the Impact of Text-to-Speech Technology0
Exploring Transfer Learning for Urdu Speech Synthesis0
Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning0
The Nós Project: Opening routes for the Galician language in the field of language technologies0
An Open Source Web Reader for Under-Resourced LanguagesCode0
Reading Assistance through LARA, the Learning And Reading Assistant0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks0
Using the LARA Little Prince to compare human and TTS audio quality0
ParlamentParla: A Speech Corpus of Catalan Parliamentary Sessions0
Preparing an Endangered Language for the Digital Age: The Case of Judeo-SpanishCode0
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
QSpeech: Low-Qubit Quantum Speech Application ToolkitCode0
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation0
Talking Face Generation with Multilingual TTS0
ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence0
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022Code0
Systematic Inequalities in Language Technology Performance across the World’s LanguagesCode0
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech0
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation0
Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch0
Fine-grained Noise Control for Multispeaker Speech Synthesis0
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance0
Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech0
Karaoker: Alignment-free singing voice synthesis with speech training data0
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis0
Arabic Text-To-Speech (TTS) Data Preparation0
Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification0
Show:102550
← PrevPage 19 of 29Next →

No leaderboard results yet.