Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis Oct 9, 2021 Lifelong learning Speech Synthesis
Code Code Available 0Environment Aware Text-to-Speech Synthesis Oct 8, 2021 Attribute Disentanglement
— Unverified 0Prosody-TTS: An end-to-end speech synthesis system with prosody control Oct 6, 2021 Rhythm Speech Synthesis
— Unverified 0Neural Speech Synthesis in German Oct 3, 2021 Speech Synthesis text-to-speech
— Unverified 0Guided-TTS:Text-to-Speech with Untranscribed Speech Sep 29, 2021 Speech Synthesis text-to-speech
— Unverified 0Conditioning Sequence-to-sequence Networks with Learned Activations Sep 29, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network Sep 22, 2021 Knowledge Distillation Language Modeling
— Unverified 0A Unified Transformer-based Framework for Duplex Text Normalization Aug 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Jul 12, 2021 Prediction Speech Synthesis
Code Code Available 0Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm Jul 6, 2021 Speech Synthesis text-to-speech
— Unverified 0Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input Jul 5, 2021 Speech Synthesis text-to-speech
Code Code Available 0PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Jun 11, 2021 Audio Generation Denoising
Code Code Available 0An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis Jun 3, 2021 Speaker Verification Speech Synthesis
— Unverified 0Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis Jun 3, 2021 Data Augmentation Speaker Verification
— Unverified 0Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis Apr 26, 2021 Language Modeling Language Modelling
Code Code Available 0Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis Apr 14, 2021 Dependency Parsing Representation Learning
— Unverified 0Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features Apr 8, 2021 Decoder Speech Synthesis
— Unverified 0Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability Apr 3, 2021 Emotion Recognition reinforcement-learning
— Unverified 0PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS Mar 28, 2021 Representation Learning Text-To-Speech Synthesis
— Unverified 0Continual Speaker Adaptation for Text-to-Speech Synthesis Mar 26, 2021 Continual Learning Diversity
— Unverified 0Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Feb 19, 2021 Language Modeling Language Modelling
— Unverified 0VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Feb 12, 2021 Speech Synthesis text-to-speech
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet Jan 30, 2021 CPU Sentence
— Unverified 0Parallel WaveNet conditioned on VAE latent vectors Dec 17, 2020 Sentence Speech Synthesis
— Unverified 0Using previous acoustic context to improve Text-to-Speech synthesis Dec 7, 2020 Decoder Speech Synthesis
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 0Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis Oct 23, 2020 Graph Attention Graph Neural Network
— Unverified 0An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems Oct 21, 2020 Grapheme-to-Phoneme Conversion Relation
— Unverified 0End-to-End Text-to-Speech using Latent Duration based on VQ-VAE Oct 19, 2020 Speech Synthesis text-to-speech
— Unverified 0MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response Oct 8, 2020 Deep Learning Prognosis
Code Code Available 0Automatic Arabic Dialect Identification Systems for Written Texts: A Survey Sep 26, 2020 Dialect Identification Machine Translation
— Unverified 0Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis Sep 17, 2020 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Controllable neural text-to-speech synthesis using intuitive prosodic features Sep 14, 2020 Sentence Speech Synthesis
— Unverified 0What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Sep 4, 2020 Decoder Sentence
— Unverified 0Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes Aug 7, 2020 Gaussian Processes Speech Synthesis
— Unverified 0Normalizing Text using Language Modelling based on Phonetics and String Similarity Jun 25, 2020 Language Modeling Language Modelling
— Unverified 0Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis May 20, 2020 Speech Synthesis text-to-speech
— Unverified 0Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation May 16, 2020 Decoder Speech Synthesis
— Unverified 0Style Variation as a Vantage Point for Code-Switching May 1, 2020 Language Modeling Language Modelling
— Unverified 0Neural Text-to-Speech Synthesis for an Under-Resourced Language in a Diglossic Environment: the Case of Gascon Occitan May 1, 2020 Speech Synthesis text-to-speech
— Unverified 0Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis Feb 28, 2020 Speech Synthesis text-to-speech
Code Code Available 0Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech Nov 28, 2019 Disentanglement Expressive Speech Synthesis
— Unverified 0Cross-lingual Multi-speaker Text-to-speech Synthesis for Voice Cloning without Using Parallel Corpus for Unseen Speakers Nov 26, 2019 Speech Synthesis text-to-speech
— Unverified 0Independent and automatic evaluation of acoustic-to-articulatory inversion models Nov 15, 2019 speech-recognition Speech Recognition
Code Code Available 0