SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 551600 of 1419 papers

TitleStatusHype
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Data Redaction from Conditional Generative Models0
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System0
AudioVisual Speech Synthesis: A brief literature review0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech0
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style0
Accented Text-to-Speech Synthesis with Limited Data0
GraphTTS: graph-to-sequence modelling in neural text-to-speech0
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
DASB -- Discrete Audio and Speech Benchmark0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition0
Cycle-consistency training for end-to-end speech recognition0
Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language0
AudioJailbreak: Jailbreak Attacks against End-to-End Large Audio-Language Models0
An Algorithm Based on Empirical Methods, for Text-to-Tuneful-Speech Synthesis of Sanskrit Verse0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model0
A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI0
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis0
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder0
An adaptable task-oriented dialog system for stand-alone embedded devices0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech0
AMuSeD: An Attentive Deep Neural Network for Multimodal Sarcasm Detection Incorporating Bi-modal Data Augmentation0
Cross-Utterance Conditioned VAE for Speech Generation0
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data0
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification0
Accent conversion using discrete units with parallel data synthesized from controllable accented TTS0
Bailing-TTS: Chinese Dialectal Speech Synthesis Towards Human-like Spontaneous Representation0
GRASS: Unified Generation Model for Speech-to-Semantic Tasks0
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance0
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech0
Audiobook Dialogues as Training Data for Conversational Style Synthetic Voices0
CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis0
Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis0
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms0
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge0
Cross-speaker style transfer for text-to-speech using data augmentation0
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation0
Cross-speaker Emotion Transfer by Manipulating Speech Style Latents0
A multilingual training strategy for low resource Text to Speech0
Adapting TTS models For New Speakers using Transfer Learning0
A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models0
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model0
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training0
Show:102550
← PrevPage 12 of 29Next →

No leaderboard results yet.