SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 13011350 of 1419 papers

TitleStatusHype
Feature reinforcement with word embedding and parsing information in neural TTS0
FPETS : Fully Parallel End-to-End Text-to-Speech SystemCode0
Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder0
Robust universal neural vocodingCode1
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms0
Speaker-adaptive neural vocoders for parametric speech synthesis systems0
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation0
Cycle-consistency training for end-to-end speech recognition0
End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator0
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization0
Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks0
Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent languageCode0
Neural source-filter-based waveform model for statistical parametric speech synthesis0
Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention0
LPCNet: Improving Neural Speech Synthesis Through Linear PredictionCode2
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
A Fully Time-domain Neural Model for Subband-based Speech Synthesizer0
Hierarchical Generative Modeling for Controllable Speech SynthesisCode0
Diacritization of Maghrebi Arabic Sub-Dialects0
A Fully Time-domain Neural Model for Subband-based Speech SynthesizerCode0
台語古詩朗誦系統A Taiwanese Text-to-Speech System for Ancient Poems[In Chinese]0
A Challenge Set and Methods for Noun-Verb Ambiguity0
Sample Efficient Adaptive Text-to-Speech0
Neural Speech Synthesis with Transformer NetworkCode2
Self-Attention Linguistic-Acoustic Decoder0
Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis0
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis0
Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder0
Multi-scale Alignment and Contextual History for Attention Mechanism in Sequence-to-sequence Model0
ClariNet: Parallel Wave Generation in End-to-End Text-to-SpeechCode1
Low-Resource Machine Transliteration Using Recurrent Neural Networks of Asian Languages0
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech SynthesisCode0
Voice Imitating Text-to-Speech Neural Networks0
Voice Builder: A Tool for Building Text-To-Speech Voices0
Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech0
Speaker-independent raw waveform model for glottal excitation0
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
Machine Speech Chain with One-shot Speaker Adaptation0
Speech to text and text to speech recognition systems-Areview0
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Efficient Neural Audio SynthesisCode2
Fitting New Speakers Based on a Short Untranscribed Sample0
Tools and resources for Romanian text-to-speech and speech-to-text applicationsCode0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
HybridNet: A Hybrid Neural Architecture to Speed-up Autoregressive Models0
Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform0
ObamaNet: Photo-realistic lip-sync from textCode0
Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided AttentionCode1
Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence LearningCode0
Show:102550
← PrevPage 27 of 29Next →

No leaderboard results yet.