Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Feb 19, 2021 Language Modeling Language Modelling
— Unverified 0VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Feb 12, 2021 Speech Synthesis text-to-speech
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet Jan 30, 2021 CPU Sentence
— Unverified 0Parallel WaveNet conditioned on VAE latent vectors Dec 17, 2020 Sentence Speech Synthesis
— Unverified 0Using previous acoustic context to improve Text-to-Speech synthesis Dec 7, 2020 Decoder Speech Synthesis
— Unverified 0Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph Entities Dec 1, 2020 Chinese Word Segmentation Speech Synthesis
Code Code Available 1Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 0Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis Nov 6, 2020 Decoder Speech Synthesis
Code Code Available 1Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph Entities Nov 5, 2020 Chinese Word Segmentation Speech Synthesis
Code Code Available 1Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Effective Deep Learning Models for Automatic Diacritization of Arabic Text Nov 1, 2020 Arabic Text Diacritization Decoder
Code Code Available 1GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis Oct 23, 2020 Graph Attention Graph Neural Network
— Unverified 0An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems Oct 21, 2020 Grapheme-to-Phoneme Conversion Relation
— Unverified 0End-to-End Text-to-Speech using Latent Duration based on VQ-VAE Oct 19, 2020 Speech Synthesis text-to-speech
— Unverified 0MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response Oct 8, 2020 Deep Learning Prognosis
Code Code Available 0Automatic Arabic Dialect Identification Systems for Written Texts: A Survey Sep 26, 2020 Dialect Identification Machine Translation
— Unverified 0Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis Sep 17, 2020 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Controllable neural text-to-speech synthesis using intuitive prosodic features Sep 14, 2020 Sentence Speech Synthesis
— Unverified 0What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Sep 4, 2020 Decoder Sentence
— Unverified 0Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WaveGrad: Estimating Gradients for Waveform Generation Sep 2, 2020 Speech Synthesis Text-To-Speech Synthesis
Code Code Available 1Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion Aug 13, 2020 Speech Synthesis text-to-speech
Code Code Available 1Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes Aug 7, 2020 Gaussian Processes Speech Synthesis
— Unverified 0Normalizing Text using Language Modelling based on Phonetics and String Similarity Jun 25, 2020 Language Modeling Language Modelling
— Unverified 0FastSpeech 2: Fast and High-Quality End-to-End Text to Speech Jun 8, 2020 Knowledge Distillation Speech Synthesis
Code Code Available 1End-to-End Adversarial Text-to-Speech Jun 5, 2020 Adversarial Text Dynamic Time Warping
Code Code Available 1Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment Search May 22, 2020 text-to-speech Text to Speech
Code Code Available 1Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis May 20, 2020 Speech Synthesis text-to-speech
— Unverified 0Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation May 16, 2020 Decoder Speech Synthesis
— Unverified 0Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis May 12, 2020 Speech Synthesis Style Transfer
Code Code Available 1Neural Text-to-Speech Synthesis for an Under-Resourced Language in a Diglossic Environment: the Case of Gascon Occitan May 1, 2020 Speech Synthesis text-to-speech
— Unverified 0Style Variation as a Vantage Point for Code-Switching May 1, 2020 Language Modeling Language Modelling
— Unverified 0Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis Feb 28, 2020 Speech Synthesis text-to-speech
Code Code Available 0Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech Nov 28, 2019 Disentanglement Expressive Speech Synthesis
— Unverified 0Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers Nov 26, 2019 Speech Synthesis text-to-speech
— Unverified 0Independent and automatic evaluation of acoustic-to-articulatory inversion models Nov 15, 2019 speech-recognition Speech Recognition
Code Code Available 0A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis Nov 11, 2019 Polyphone disambiguation Speech Synthesis
— Unverified 0Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework Nov 7, 2019 Sentence Speech Synthesis
— Unverified 0Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis Oct 29, 2019 Speaker Verification Speech Synthesis
Code Code Available 0Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment Oct 28, 2019 Hard Attention Speech Synthesis
— Unverified 0Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram Oct 25, 2019 Generative Adversarial Network GPU
Code Code Available 2The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach Oct 14, 2019 Expressive Speech Synthesis Sociology
— Unverified 0Modular Meta-Learning with Shrinkage Sep 12, 2019 Image Classification Meta-Learning
— Unverified 0Evaluating Long-form Text-to-Speech: Comparing the Ratings of Sentences and Paragraphs Sep 9, 2019 Form Speech Synthesis
— Unverified 0Neural Harmonic-plus-Noise Waveform Model with Trainable Maximum Voice Frequency for Text-to-Speech Synthesis Aug 27, 2019 Speech Synthesis text-to-speech
— Unverified 0MelNet: A Generative Model for Audio in the Frequency Domain Jun 4, 2019 Audio Generation Music Generation
Code Code Available 0Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Jun 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0