Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 1419 papers

Title	Date	Tasks	Status
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit	Oct 24, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis	Oct 23, 2019	FormSpeech Synthesis	CodeCode Available
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR	Oct 22, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach	Oct 14, 2019	Expressive Speech SynthesisSociology	—Unverified
Semi-Supervised Generative Modeling for Controllable Speech Synthesis	Oct 3, 2019	Speech Synthesistext-to-speech	—Unverified
High Fidelity Speech Synthesis with Adversarial Networks	Sep 25, 2019	Generative Adversarial NetworkSpeech Synthesis	CodeCode Available
Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech	Sep 14, 2019	text-to-speechText to Speech	—Unverified
A Comparative Study on Transformer vs RNN in Speech Applications	Sep 13, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Modular Meta-Learning with Shrinkage	Sep 12, 2019	Image ClassificationMeta-Learning	—Unverified
Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs	Sep 9, 2019	FormSpeech Synthesis	—Unverified
Neural Network-Based Modeling of Phonetic Durations	Sep 6, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Large-Scale User Study of an Alexa Prize Chatbot: Effect of TTS Dynamism on Perceived Quality of Social Dialog	Sep 1, 2019	Chatbottext-to-speech	—Unverified
Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments	Aug 30, 2019	Decodertext-to-speech	—Unverified
Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis	Aug 27, 2019	Speech Synthesistext-to-speech	—Unverified
From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories	Aug 20, 2019	RetrievalTAG	—Unverified
Numbers Normalisation in the Inflected Languages: a Case Study of Polish	Aug 1, 2019	text-to-speechText to Speech	CodeCode Available
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible	Jul 30, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Hierarchical Sequence to Sequence Voice Conversion with Limited Data	Jul 15, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention	Jul 9, 2019	Dialogue GenerationImage Captioning	—Unverified
A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech -- a Deep Learning approach	Jul 5, 2019	text-to-speechText to Speech	—Unverified
A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples	Jul 4, 2019	BinarizationGeneral Classification	—Unverified
Fine-grained robust prosody transfer for single-speaker neural text-to-speech	Jul 4, 2019	text-to-speechText to Speech	—Unverified
Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features	Jul 3, 2019	Polyphone disambiguationSentence	—Unverified
Improving Performance of End-to-End ASR on Numeric Sequences	Jul 1, 2019	speech-recognitionSpeech Recognition	—Unverified
An adaptable task-oriented dialog system for stand-alone embedded devices	Jul 1, 2019	Dialogue ManagementManagement	—Unverified
RUSLAN: Russian Spoken Language Corpus for Speech Synthesis	Jun 26, 2019	Speech Synthesistext-to-speech	—Unverified
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling	Jun 17, 2019	Representation LearningSpeech Representation Learning	—Unverified
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models	Jun 17, 2019	DecoderSpeech Synthesis	—Unverified
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise	Jun 13, 2019	Data AugmentationDecoder	—Unverified
Using generative modelling to produce varied intonation for speech synthesis	Jun 10, 2019	SentenceSpeech Synthesis	CodeCode Available
Non-Differentiable Supervised Learning with Evolution Strategies and Hybrid Methods	Jun 7, 2019	text-to-speechText to Speech	—Unverified
MelNet: A Generative Model for Audio in the Frequency Domain	Jun 4, 2019	Audio GenerationMusic Generation	CodeCode Available
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain	Jun 3, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Customizing Grapheme-to-Phoneme System for Non-Trivial Transcription Problems in Bangla Language	Jun 1, 2019	speech-recognitionSpeech Recognition	—Unverified
Neural Text Normalization with Subword Units	Jun 1, 2019	Machine TranslationNatural Language Understanding	—Unverified
Neural Models of Text Normalization for Speech Applications	Jun 1, 2019	BIG-bench Machine LearningSpeech Synthesis	—Unverified
Highly Effective Arabic Diacritization using Sequence to Sequence Modeling	Jun 1, 2019	Feature EngineeringMachine Translation	—Unverified
A Cost Efficient Approach to Correct OCR Errors in Large Document Collections	May 28, 2019	ClusteringLanguage Modelling	—Unverified
Non-Autoregressive Neural Text-to-Speech	May 21, 2019	text-to-speechText to Speech	CodeCode Available
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems	May 21, 2019	parameter estimationSpeech Synthesis	CodeCode Available
CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network	May 17, 2019	DecoderSentence	—Unverified
Almost Unsupervised Text to Speech and Automatic Speech Recognition	May 13, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text	Apr 30, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
The Zero Resource Speech Challenge 2019: TTS without T	Apr 25, 2019	text-to-speechText to Speech	—Unverified
Expediting TTS Synthesis with Adversarial Vocoding	Apr 16, 2019	text-to-speechText to Speech	CodeCode Available
End-to-end Text-to-speech for Low-resource Languages by Cross-Lingual Transfer Learning	Apr 13, 2019	Cross-Lingual Transfertext-to-speech	—Unverified
Direct speech-to-speech translation with a sequence-to-sequence model	Apr 12, 2019	Speech SynthesisSpeech-to-Speech Translation	CodeCode Available
Building a mixed-lingual neural TTS system with only monolingual data	Apr 12, 2019	Decodertext-to-speech	—Unverified
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram	Apr 8, 2019	Speech Synthesistext-to-speech	CodeCode Available
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion	Apr 6, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 26 of 29Next →

No leaderboard results yet.