SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 13011350 of 1419 papers

TitleStatusHype
Speech denoising by parametric resynthesis0
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora0
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworksCode0
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet0
CSS10: A Collection of Single Speaker Speech Datasets for 10 LanguagesCode0
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis0
Deep Text-to-Speech System with Seq2Seq Model0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Unsupervised Polyglot Text To Speech0
Hand Sign to Bangla Speech: A Deep Learning in Vision based system for Recognizing Hand Sign Digits and Generating Bangla Speech0
Feature reinforcement with word embedding and parsing information in neural TTS0
FPETS : Fully Parallel End-to-End Text-to-Speech SystemCode0
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder0
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms0
Speaker-adaptive neural vocoders for parametric speech synthesis systems0
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation0
Cycle-consistency training for end-to-end speech recognition0
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator0
Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks0
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization0
Neural source-filter-based waveform model for statistical parametric speech synthesis0
Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention0
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent languageCode0
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer0
Hierarchical Generative Modeling for Controllable Speech SynthesisCode0
Diacritization of Maghrebi Arabic Sub-Dialects0
A Fully Time-domain Neural Model for Subband-based Speech SynthesizerCode0
台語古詩朗誦系統A Taiwanese Text-to-Speech System for Ancient Poems[In Chinese]0
A Challenge Set and Methods for Noun-Verb Ambiguity0
Sample Efficient Adaptive Text-to-Speech0
Self-Attention Linguistic-Acoustic Decoder0
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis0
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis0
Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder0
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model0
Low-Resource Machine Transliteration Using Recurrent Neural Networks of Asian Languages0
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech SynthesisCode0
Voice Imitating Text-to-Speech Neural Networks0
Voice Builder: A Tool for Building Text-To-Speech Voices0
Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech0
Speaker-independent raw waveform model for glottal excitation0
Machine Speech Chain with One-shot Speaker Adaptation0
Speech to text and text to speech recognition systems-Areview0
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Fitting New Speakers Based on a Short Untranscribed Sample0
Tools and resources for Romanian text-to-speech and speech-to-text applicationsCode0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
HybridNet: A Hybrid Neural Architecture to Speed-up Autoregressive Models0
Show:102550
← PrevPage 27 of 29Next →

No leaderboard results yet.