Multi-Speaker End-to-End Speech Synthesis Jul 9, 2019 Speech Synthesis
— Unverified 0\'Evaluation objective de plongements pour la synth\`ese de parole guid\'ee par r\'eseaux de neurones (Objective evaluation of embeddings for speech synthesis guided by neural networks) Jul 1, 2019 Speech Synthesis
— Unverified 0Deep Residual Neural Networks for Audio Spoofing Detection Jun 30, 2019 Speaker Verification Speech Synthesis
Code Code Available 0End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training Jun 26, 2019 Emotional Speech Synthesis Emotion Recognition
— Unverified 0RUSLAN: Russian Spoken Language Corpus for Speech Synthesis Jun 26, 2019 Speech Synthesis text-to-speech
— Unverified 0A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation Jun 18, 2019 Decoder Speech Synthesis
— Unverified 0Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models Jun 17, 2019 Decoder Speech Synthesis
— Unverified 0Using generative modelling to produce varied intonation for speech synthesis Jun 10, 2019 Sentence Speech Synthesis
Code Code Available 0Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis Jun 8, 2019 Expressive Speech Synthesis Speech Synthesis
Code Code Available 1MelNet: A Generative Model for Audio in the Frequency Domain Jun 4, 2019 Audio Generation Music Generation
Code Code Available 0Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Jun 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Neural Models of Text Normalization for Speech Applications Jun 1, 2019 BIG-bench Machine Learning Speech Synthesis
— Unverified 0Permanent Magnetic Articulograph (PMA) vs Electromagnetic Articulograph (EMA) in Articulation-to-Speech Synthesis for Silent Speech Interface Jun 1, 2019 Speech Synthesis
— Unverified 0Neural Text Normalization with Subword Units Jun 1, 2019 Machine Translation Natural Language Understanding
— Unverified 0Speaker Anonymization Using X-vector and Neural Waveform Models May 30, 2019 Speaker anonymization Speaker Verification
— Unverified 0Video-to-Video Translation for Visual Speech Synthesis May 28, 2019 Image-to-Image Translation Speech Synthesis
— Unverified 0ET-GAN: Cross-Language Emotion Transfer Based on Cycle-Consistent Generative Adversarial Networks May 27, 2019 Domain Adaptation Generative Adversarial Network
— Unverified 0FastSpeech: Fast, Robust and Controllable Text to Speech May 22, 2019 Decoder Speech Synthesis
Code Code Available 2Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems May 21, 2019 parameter estimation Speech Synthesis
Code Code Available 0CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network May 17, 2019 Decoder Sentence
— Unverified 0Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models May 15, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Synthetic-Neuroscore: Using A Neuro-AI Interface for Evaluating Generative Adversarial Networks May 10, 2019 Image Generation Speech Synthesis
Code Code Available 1Neural source-filter waveform models for statistical parametric speech synthesis Apr 27, 2019 Speech Synthesis
— Unverified 0Latent Variable Algorithms for Multimodal Learning and Sensor Fusion Apr 23, 2019 Activity Recognition Decision Making
— Unverified 0Spoof detection using time-delay shallow neural network and feature switching Apr 16, 2019 Speaker Verification Speech Synthesis
Code Code Available 0Unsupervised acoustic unit discovery for speech synthesis using discrete latent-variable neural networks Apr 16, 2019 Acoustic Unit Discovery Decoder
— Unverified 0Direct speech-to-speech translation with a sequence-to-sequence model Apr 12, 2019 Speech Synthesis Speech-to-Speech Translation
Code Code Available 0A high quality and phonetic balanced speech corpus for Vietnamese Apr 11, 2019 Speech Synthesis Vocal Bursts Intensity Prediction
— Unverified 0STC Antispoofing Systems for the ASVspoof2019 Challenge Apr 11, 2019 Speech Synthesis Voice Conversion
Code Code Available 0RawNet: Fast End-to-End Neural Vocoder Apr 10, 2019 Speech Synthesis
Code Code Available 0SPEAK YOUR MIND! Towards Imagined Speech Recognition With Hierarchical Deep Learning Apr 8, 2019 Brain Computer Interface General Classification
— Unverified 0Deep Learning the EEG Manifold for Phonological Categorization from Active Thoughts Apr 8, 2019 Binary Classification Deep Learning
— Unverified 0GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram Apr 8, 2019 Speech Synthesis text-to-speech
Code Code Available 0WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation Apr 5, 2019 Speech Synthesis
— Unverified 0Multi-reference Tacotron by Intercross Training for Style Disentangling,Transfer and Control in Speech Synthesis Apr 4, 2019 Diversity Speech Synthesis
— Unverified 0In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data Apr 4, 2019 Speech Synthesis text-to-speech
Code Code Available 1Speech denoising by parametric resynthesis Apr 2, 2019 Denoising Resynthesis
— Unverified 0Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet Mar 29, 2019 Decoder Speech Synthesis
— Unverified 0A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet Mar 28, 2019 Speech Synthesis
Code Code Available 2Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis through Audio Analysis Mar 27, 2019 Emotional Speech Synthesis Expressive Speech Synthesis
Code Code Available 1Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis Mar 14, 2019 Generative Adversarial Network Speech Synthesis
— Unverified 0Deep Text-to-Speech System with Seq2Seq Model Mar 11, 2019 model Speech Synthesis
— Unverified 0The Virtual Doctor: An Interactive Artificial Intelligence based on Deep Learning for Non-Invasive Prediction of Diabetes Mar 9, 2019 Prognosis speech-recognition
— Unverified 0Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Feb 18, 2019 Speech Synthesis Voice Cloning
— Unverified 0Exploring Transfer Learning for Low Resource Emotional TTS Jan 14, 2019 Deep Learning Emotional Speech Synthesis
Code Code Available 1Learning latent representations for style control and transfer in end-to-end speech synthesis Dec 11, 2018 Speech Synthesis Style Transfer
Code Code Available 0Learning pronunciation from a foreign language in speech synthesis networks Nov 23, 2018 Speech Synthesis
Code Code Available 1Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes Nov 22, 2018 All speech-recognition
— Unverified 0Effect of data reduction on sequence-to-sequence neural TTS Nov 15, 2018 Speech Synthesis
— Unverified 0AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 0