SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 726750 of 1419 papers

TitleStatusHype
Parallel Attention Forcing for Machine Translation0
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech0
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages0
Investigating Content-Aware Neural Text-To-Speech MOS Prediction Using Prosodic and Linguistic Features0
Generating Multilingual Gender-Ambiguous Text-to-Speech Voices0
Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers0
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection0
Structured State Space Decoder for Speech Recognition and Synthesis0
Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation0
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier TransformCode2
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis0
Residual Adapters for Few-Shot Text-to-Speech Speaker Adaptation0
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders0
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech SynthesisCode1
Explicit Intensity Control for Accented Text-to-speech0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
Improving Speech-to-Speech Translation Through Unlabeled Text0
Semi-Supervised Learning Based on Reference Model for Low-resource TTS0
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data0
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS0
HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice GenerationCode1
Low-Resource Multilingual and Zero-Shot Multispeaker TTS0
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification0
Towards Relation Extraction From SpeechCode1
Generating Synthetic Speech from SpokenVocab for Speech TranslationCode0
Show:102550
← PrevPage 30 of 57Next →

No leaderboard results yet.