SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 626650 of 1419 papers

TitleStatusHype
Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data AugmentationCode1
Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found DataCode1
A unified front-end framework for English text-to-speech synthesis0
FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net Encoder With Multiple STFTs0
Controllable Speaking Styles Using a Large Language Model0
Better speech synthesis through scalingCode6
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency ModelCode2
Accented Text-to-Speech Synthesis with Limited Data0
Bts-e: Audio deepfake detection using breathing-talking-silence encoderCode1
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis0
A Review of Deep Learning Techniques for Speech Processing0
Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech SynthesisCode2
Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model0
DiffVoice: Text-to-Speech with Latent Diffusion0
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained HubertCode4
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing SynthesizersCode2
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
Enhancing Speech-to-Speech Translation with Multiple TTS Targets0
An investigation of phrase break prediction in an End-to-End TTS systemCode0
ArmanTTS single-speaker Persian dataset0
Ensemble prosody prediction for expressive speech synthesis0
AraSpot: Arabic Spoken Command SpottingCode0
Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource LanguagesCode1
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis0
Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis0
Show:102550
← PrevPage 26 of 57Next →

No leaderboard results yet.