SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 101125 of 1419 papers

TitleStatusHype
Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality AlignmentCode2
DiffSinger: Singing Voice Synthesis via Shallow Diffusion MechanismCode2
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-InstrumentCode2
Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesisCode2
TransVIP: Speech to Speech Translation System with Voice and Isochrony PreservationCode2
PAM: Prompting Audio-Language Models for Audio Quality AssessmentCode2
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to SpeechCode1
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited DataCode1
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style ConversionCode1
InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech SystemsCode1
KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis DatasetCode1
End-to-End Adversarial Text-to-SpeechCode1
Improving TTS for Shanghainese: Addressing Tone Sandhi via Word SegmentationCode1
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer LearningCode1
End to End Lip Synchronization with a Temporal AutoEncoderCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition ModelCode1
EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to SpeechCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnetCode1
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech UnderstandingCode1
Evaluating Speech Synthesis by Training Recognizers on Synthetic SpeechCode1
Emotion-Aware Prosodic Phrasing for Expressive Text-to-SpeechCode1
Show:102550
← PrevPage 5 of 57Next →

No leaderboard results yet.