SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 601650 of 1419 papers

TitleStatusHype
Guided Flows for Generative Modeling and Decision Making0
Cross-speaker Emotion Transfer by Manipulating Speech Style Latents0
A multilingual training strategy for low resource Text to Speech0
A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models0
Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model0
Accent Conversion in Text-To-Speech Using Multi-Level VAE and Adversarial Training0
Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation0
Cross-Lingual Text-to-Speech Using Multi-Task Learning and Speaker Classifier Joint Training0
AttentionStitch: How Attention Solves the Speech Editing Problem0
Handling Numeric Expressions in Automatic Speech Recognition0
GOAT-TTS: Expressive and Realistic Speech Generation via A Dual-Branch LLM0
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario0
Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers0
Get Large Language Models Ready to Speak: A Late-fusion Approach for Speech Generation0
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech0
Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning0
An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems0
Grapheme-to-Phoneme Transformer Model for Transfer Learning Dialects0
GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis0
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis0
GraphTTS: graph-to-sequence modelling in neural text-to-speech0
GRASS: Unified Generation Model for Speech-to-Semantic Tasks0
Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech0
Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework0
Cross-Domain Audio Deepfake Detection: Dataset and Analysis0
Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance0
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach0
Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data0
Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech0
Harder or Different? Understanding Generalization of Audio Deepfake Detection0
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM0
Hear Your Code Fail, Voice-Assisted Debugging for Python0
Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech0
Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech0
Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-Speech0
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis0
Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control0
Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis0
Hierarchical Representation of Prosody for Statistical Speech Synthesis0
Hierarchical Sequence to Sequence Voice Conversion with Limited Data0
Generative Semantic Communication for Text-to-Speech Synthesis0
Generative Pre-training for Speech with Flow Matching0
HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset0
Cross-Dialect Text-To-Speech in Pitch-Accent Language Incorporating Multi-Dialect Phoneme-Level BERT0
Audio Deep Fake Detection System with Neural Stitching for ADD 20220
High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models0
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement0
Highly Effective Arabic Diacritization using Sequence to Sequence Modeling0
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units0
Creating New Voices using Normalizing Flows0
Show:102550
← PrevPage 13 of 29Next →

No leaderboard results yet.