SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 651675 of 1419 papers

TitleStatusHype
HLTCOE JHU Submission to the Voice Privacy Challenge 20240
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement0
HMM-based data augmentation for E2E systems for building conversational speech synthesis systems0
Creating New Voices using Normalizing Flows0
Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech0
Human Detection of Political Speech Deepfakes across Transcripts, Audio, and Video0
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder0
HybridNet: A Hybrid Neural Architecture to Speed-up Autoregressive Models0
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform0
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis0
Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations0
Attempt Towards Stress Transfer in Speech-to-Speech Machine Translation0
Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
A Melody-Unsupervision Model for Singing Voice Synthesis0
Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis0
Improve few-shot voice cloning using multi-modal learning0
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech0
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems0
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model0
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation0
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model0
Generating Rich Product Descriptions for Conversational E-commerce Systems0
Show:102550
← PrevPage 27 of 57Next →

No leaderboard results yet.