SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 9511000 of 1419 papers

TitleStatusHype
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading0
Contextual Expressive Text-to-Speech0
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory0
Continual Speaker Adaptation for Text-to-Speech Synthesis0
Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations0
Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM0
Continuous Speech Synthesis using per-token Latent Diffusion0
Controllable Accented Text-to-Speech Synthesis0
Controllable Emphasis with zero data for text-to-speech0
Controllable neural text-to-speech synthesis using intuitive prosodic features0
Controllable speech synthesis by learning discrete phoneme-level prosodic representations0
Controlling Emotion in Text-to-Speech with Natural Language Prompts0
Controllable Prosody Generation With Partial Inputs0
Controlling Prosody in End-to-End TTS: A Case Study on Contrastive Focus Generation0
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech0
Corpus Generation for Voice Command in Smart Home and the Effect of Speech Synthesis on End-to-End SLU0
Counterfactual Activation Editing for Post-hoc Prosody and Mispronunciation Correction in TTS Models0
Learning Speech Representation From Contrastive Token-Acoustic Pretraining0
Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations0
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform0
Creating New Voices using Normalizing Flows0
Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT0
Cross-Domain Audio Deepfake Detection: Dataset and Analysis0
Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech0
Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers0
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario0
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training0
Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation0
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model0
Cross-speaker Emotion Transfer by Manipulating Speech Style Latents0
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation0
Cross-speaker style transfer for text-to-speech using data augmentation0
Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis0
CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis0
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech0
Cross-Utterance Conditioned VAE for Speech Generation0
Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech0
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder0
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis0
Cued Speech Generation Leveraging a Pre-trained Audiovisual Text-to-Speech Model0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language0
Cycle-consistency training for end-to-end speech recognition0
Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
DASB -- Discrete Audio and Speech Benchmark0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys0
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Show:102550
← PrevPage 20 of 29Next →

No leaderboard results yet.