Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 1419 papers

Title	Date	Tasks	Status
Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020)	May 11, 2020	Clusteringspeech-recognition	CodeCode Available
Luganda Text-to-Speech Machine	May 11, 2020	text-to-speechText to Speech	CodeCode Available
IndicSpeech: Text-to-Speech Corpus for Indian Languages	May 1, 2020	text-to-speechText to Speech	—Unverified
Corpus Generation for Voice Command in Smart Home and the Effect of Speech Synthesis on End-to-End SLU	May 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Neural Text-to-Speech Synthesis for an Under-Resourced Language in a Diglossic Environment: the Case of Gascon Occitan	May 1, 2020	Speech Synthesistext-to-speech	—Unverified
Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech	May 1, 2020	text-to-speechText to Speech	—Unverified
Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech	May 1, 2020	Text Normalizationtext-to-speech	—Unverified
Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems	May 1, 2020	Speech Synthesistext-to-speech	—Unverified
Development and Evaluation of Speech Synthesis Corpora for Latvian	May 1, 2020	speech-recognitionSpeech Recognition	—Unverified
Open-Source High Quality Speech Datasets for Basque, Catalan and Galician	May 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Style Variation as a Vantage Point for Code-Switching	May 1, 2020	Language ModelingLanguage Modelling	—Unverified
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech	Apr 30, 2020	Rhythmtext-to-speech	—Unverified
A Study of Non-autoregressive Model for Sequence Generation	Apr 22, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ESPnet-ST: All-in-One Speech Translation Toolkit	Apr 21, 2020	AllAutomatic Speech Recognition	—Unverified
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System	Apr 20, 2020	text-to-speechText to Speech	—Unverified
Scalable Multilingual Frontend for TTS	Apr 10, 2020	ChunkingMachine Translation	—Unverified
Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data	Apr 10, 2020	text-to-speechText to Speech	—Unverified
Improving Readability for Automatic Speech Recognition Transcription	Apr 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Statistical Context-Dependent Units Boundary Correction for Corpus-based Unit-Selection Text-to-Speech	Mar 5, 2020	Segmentationtext-to-speech	—Unverified
GraphTTS: graph-to-sequence modelling in neural text-to-speech	Mar 4, 2020	Graph EmbeddingGraph-to-Sequence	—Unverified
AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit Alignment	Mar 4, 2020	text-to-speechText to Speech	CodeCode Available
Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis	Feb 28, 2020	Speech Synthesistext-to-speech	CodeCode Available
On the Discrepancy between Density Estimation and Sequence Generation	Feb 17, 2020	Density EstimationMachine Translation	—Unverified
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis	Feb 6, 2020	DisentanglementSpeech Synthesis	—Unverified
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior	Feb 6, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
BOFFIN TTS: Few-Shot Speaker Adaptation by Bayesian Optimization	Feb 4, 2020	Bayesian Optimizationtext-to-speech	—Unverified
WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss	Feb 2, 2020	text-to-speechText to Speech	—Unverified
Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network	Jan 31, 2020	QuantizationSpeech Synthesis	—Unverified
From Speech-to-Speech Translation to Automatic Dubbing	Jan 19, 2020	Machine TranslationSpeech-to-Speech Translation	—Unverified
Smart Summarizer for Blind People	Jan 1, 2020	text-to-speechText to Speech	—Unverified
Parallel Neural Text-to-Speech	Jan 1, 2020	text-to-speechText to Speech	—Unverified
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems	Dec 19, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Singing Synthesis: with a little help from my attention	Dec 12, 2019	text-to-speechText to Speech	—Unverified
Neural Voice Puppetry: Audio-driven Facial Reenactment	Dec 11, 2019	Face ModelNeural Rendering	CodeCode Available
Semantic Mask for Transformer based End-to-End Speech Recognition	Dec 6, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Towards Robust Neural Vocoding for Speech Generation: A Survey	Dec 5, 2019	Speech SynthesisSurvey	—Unverified
Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection	Dec 2, 2019	Speech Synthesistext-to-speech	—Unverified
Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech	Nov 28, 2019	DisentanglementExpressive Speech Synthesis	—Unverified
Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers	Nov 26, 2019	Speech Synthesistext-to-speech	—Unverified
Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features	Nov 21, 2019	text-to-speechText to Speech	—Unverified
Independent and automatic evaluation of acoustic-to-articulatory inversion models	Nov 15, 2019	speech-recognitionSpeech Recognition	CodeCode Available
A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis	Nov 11, 2019	Polyphone disambiguationSpeech Synthesis	—Unverified
Emotional Voice Conversion using Multitask Learning with Text-to-speech	Nov 11, 2019	Decodertext-to-speech	CodeCode Available
Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework	Nov 7, 2019	SentenceSpeech Synthesis	—Unverified
Teacher-Student Training for Robust Tacotron-based TTS	Nov 7, 2019	DecoderKnowledge Distillation	—Unverified
A System for Diacritizing Four Varieties of Arabic	Nov 1, 2019	Feature Engineeringtext-to-speech	—Unverified
Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis	Oct 29, 2019	Speaker VerificationSpeech Synthesis	CodeCode Available
Unsupervised pre-training for sequence to sequence speech recognition	Oct 28, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment	Oct 28, 2019	Hard AttentionSpeech Synthesis	—Unverified
Multi-Reference Neural TTS Stylization with Adversarial Cycle Consistency	Oct 25, 2019	Emotion ClassificationStyle Transfer	—Unverified

Show:10 25 50

← PrevPage 25 of 29Next →

No leaderboard results yet.