SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 651675 of 1419 papers

TitleStatusHype
IMaSC -- ICFOSS Malayalam Speech Corpus0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
HybridNet: A Hybrid Neural Architecture to Speed-up Autoregressive Models0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition0
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS0
Cycle-consistency training for end-to-end speech recognition0
Human Detection of Political Speech Deepfakes across Transcripts, Audio, and Video0
Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language0
AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models0
An Algorithm Based on Empirical Methods, for Text-to-Tuneful-Speech Synthesis of Sanskrit Verse0
HMM-based data augmentation for E2E systems for building conversational speech synthesis systems0
Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
HLTCOE JHU Submission to the Voice Privacy Challenge 20240
Improve few-shot voice cloning using multi-modal learning0
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech0
Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model0
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model0
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation0
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model0
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI0
Show:102550
← PrevPage 27 of 57Next →

No leaderboard results yet.