SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 376400 of 1419 papers

TitleStatusHype
ECAPA-TDNN for Multi-speaker Text-to-speech SynthesisCode0
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systemsCode0
ClonEval: An Open Voice Cloning BenchmarkCode0
Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet VocoderCode0
FluentEditor2: Text-based Speech Editing by Modeling Multi-Scale Acoustic and Prosody ConsistencyCode0
Learning High-Frequency Functions Made Easy with Sinusoidal Positional EncodingCode0
Clip-TTS: Contrastive Text-content and Mel-spectrogram, A High-Quality Text-to-Speech Method based on Contextual Semantic Understanding0
ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus0
ArmanTTS single-speaker Persian dataset0
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech0
A Review of Multi-Modal Large Language and Vision Models0
A Human-in-the-Loop Approach to Improving Cross-Text Prosody Transfer0
CHULA TTS: A Modularized Text-To-Speech Framework0
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network0
A Review of Deep Learning Techniques for Speech Processing0
ChatAnything: Facetime Chat with LLM-Enhanced Personas0
Character-Level Bangla Text-to-IPA Transcription Using Transformer Architecture with Sequence Alignment0
A review-based study on different Text-to-Speech technologies0
A Generative Model of a Pronunciation Lexicon for Hindi0
A Cost Efficient Approach to Correct OCR Errors in Large Document Collections0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface0
Arabic Text-To-Speech (TTS) Data Preparation0
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer0
Show:102550
← PrevPage 16 of 57Next →

No leaderboard results yet.