Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 1419 papers

Title	Date	Tasks	Status
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework	Nov 4, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Training Wake Word Detection with Synthesized Speech Data on Confusion Words	Nov 3, 2020	Data AugmentationKeyword Spotting	—Unverified
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech	Nov 2, 2020	Knowledge DistillationSpeech Synthesis	—Unverified
Learning from Explanations and Demonstrations: A Pilot Study	Nov 1, 2020	text-to-speechText to Speech	—Unverified
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech	Oct 29, 2020	Decodertext-to-speech	—Unverified
Effective Decoder Masking for Transformer Based End-to-End Speech Recognition	Oct 27, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators	Oct 27, 2020	text-to-speechText to Speech	—Unverified
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition	Oct 26, 2020	Emotion RecognitionSpeech Emotion Recognition	—Unverified
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis	Oct 23, 2020	Graph AttentionGraph Neural Network	—Unverified
The NTU-AISG Text-to-speech System for Blizzard Challenge 2020	Oct 22, 2020	text-to-speechText to Speech	—Unverified
NU-GAN: High resolution neural upsampling with GAN	Oct 22, 2020	Audio GenerationSpeech Synthesis	—Unverified
Learning Speaker Embedding from Text-to-Speech	Oct 21, 2020	ClassificationDecoder	CodeCode Available
A Mask-based Model for Mandarin Chinese Polyphone Disambiguation	Oct 21, 2020	Polyphone disambiguationtext-to-speech	—Unverified
An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems	Oct 21, 2020	Grapheme-to-Phoneme ConversionRelation	—Unverified
Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction	Oct 20, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE	Oct 19, 2020	Speech Synthesistext-to-speech	—Unverified
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion	Oct 16, 2020	Speech Synthesistext-to-speech	—Unverified
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS	Oct 12, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion	Oct 8, 2020	text-to-speechText to Speech	—Unverified
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems	Oct 8, 2020	Data Augmentationintent-classification	—Unverified
Neural Speech Synthesis for Estonian	Oct 6, 2020	SentenceSpeech Synthesis	—Unverified
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS	Oct 6, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
JSSS: free Japanese speech corpus for summarization and simplification	Oct 5, 2020	FormSpeech Synthesis	CodeCode Available
Compress Polyphone Pronunciation Prediction Model with Shared Labels	Oct 1, 2020	PredictionQuantization	—Unverified
Automatic Arabic Dialect Identification Systems for Written Texts: A Survey	Sep 26, 2020	Dialect IdentificationMachine Translation	—Unverified
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis	Sep 17, 2020	Expressive Speech SynthesisSpeech Synthesis	—Unverified
Controllable neural text-to-speech synthesis using intuitive prosodic features	Sep 14, 2020	SentenceSpeech Synthesis	—Unverified
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS	Sep 4, 2020	DecoderSentence	—Unverified
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer	Sep 3, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Textual Echo Cancellation	Aug 13, 2020	Acoustic echo cancellationspeech-recognition	—Unverified
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages	Aug 11, 2020	Quantizationtext-to-speech	—Unverified
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems	Aug 11, 2020	text-to-speechText to Speech	—Unverified
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition	Aug 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes	Aug 7, 2020	Gaussian ProcessesSpeech Synthesis	—Unverified
Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning	Aug 7, 2020	Audio Generationreinforcement-learning	—Unverified
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability	Jul 30, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture	Jul 22, 2020	RhythmSpeech Synthesis	—Unverified
Normalizing Text using Language Modelling based on Phonetics and String Similarity	Jun 25, 2020	Language ModelingLanguage Modelling	—Unverified
Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework	Jun 12, 2020	text-to-speechText to Speech	—Unverified
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning	Jun 5, 2020	Self-Supervised LearningSpeaker Verification	CodeCode Available
NAUTILUS: a Versatile Voice Cloning System	May 22, 2020	Speech Synthesistext-to-speech	—Unverified
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario	May 21, 2020	AttributeSpeech Synthesis	—Unverified
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis	May 20, 2020	Speech Synthesistext-to-speech	—Unverified
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech	May 19, 2020	text-to-speechText to Speech	—Unverified
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders	May 18, 2020	text-to-speechText to Speech	—Unverified
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation	May 16, 2020	DecoderSpeech Synthesis	—Unverified
JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment	May 15, 2020	text-to-speechText to Speech	—Unverified
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation	May 14, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN	May 12, 2020	Few-Shot Learningtext-to-speech	—Unverified
DiscreTalk: Text-to-Speech as a Machine Translation Problem	May 12, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 24 of 29Next →

No leaderboard results yet.