Exploring the Potential of Lexical Paraphrases for Mitigating Noise-Induced Comprehension Errors Jul 18, 2021 Speech Synthesis
— Unverified 0Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Jul 12, 2021 Prediction Speech Synthesis
Code Code Available 0Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm Jul 6, 2021 Speech Synthesis text-to-speech
— Unverified 0Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input Jul 5, 2021 Speech Synthesis text-to-speech
Code Code Available 0An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 0A Survey on Neural Speech Synthesis Jun 29, 2021 Speech Synthesis Survey
Code Code Available 1FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis Jun 29, 2021 Speech Synthesis text-to-speech
Code Code Available 1GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis Jun 29, 2021 Speech Synthesis text-to-speech
— Unverified 0Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Jun 25, 2021 Quantization Speaker anonymization
— Unverified 0Distilling the Knowledge from Conditional Normalizing Flows Jun 24, 2021 Image Super-Resolution Speech Synthesis
Code Code Available 0Controllable Context-aware Conversational Speech Synthesis Jun 21, 2021 Speech Synthesis
— Unverified 0Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis Jun 21, 2021 Speech Synthesis
— Unverified 0Non-native English lexicon creation for bilingual speech synthesis Jun 21, 2021 Speech Synthesis text-to-speech
— Unverified 0UniTTS: Residual Learning of Unified Embedding Space for Speech Style Control Jun 21, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters Jun 19, 2021 Speech Synthesis text-to-speech
— Unverified 0EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model Jun 17, 2021 Emotional Speech Synthesis Emotion Classification
— Unverified 0WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis Jun 17, 2021 Speech Synthesis text-to-speech
Code Code Available 1A Flow-Based Neural Network for Time Domain Speech Enhancement Jun 16, 2021 Density Estimation Speech Enhancement
— Unverified 0Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis Jun 15, 2021 Speech Synthesis text-to-speech
— Unverified 0RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis Jun 15, 2021 speech-recognition Speech Recognition
Code Code Available 1Pathological voice adaptation with autoencoder-based voice conversion Jun 15, 2021 Speech Synthesis Voice Conversion
— Unverified 0UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation Jun 15, 2021 Speech Synthesis text-to-speech
Code Code Available 3Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis Jun 12, 2021 Speech Synthesis
— Unverified 0PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Jun 11, 2021 Audio Generation Denoising
— Unverified 0Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache Jun 11, 2021 Speech Synthesis
— Unverified 0Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-based Multi-modal Context Modeling Jun 11, 2021 Speech Synthesis text-to-speech
Code Code Available 1Learning to Efficiently Sample from Diffusion Probabilistic Models Jun 7, 2021 Denoising Speech Synthesis
— Unverified 0Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis Jun 6, 2021 CPU GPU
— Unverified 0Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis Jun 3, 2021 Data Augmentation Speaker Verification
— Unverified 0An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis Jun 3, 2021 Speaker Verification Speech Synthesis
— Unverified 0RAD-TTS: Parallel Flow-Based TTS with Robust Alignment Learning and Diverse Synthesis Jun 2, 2021 Diversity Rhythm
Code Code Available 1Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 0Byakto Speech: Real-time long speech synthesis with convolutional neural network: Transfer learning from English to Bangla May 31, 2021 Deep Learning speech-recognition
Code Code Available 1A Corpus of Neutral Voice Speech in Brazilian Portuguese May 21, 2021 Speech Synthesis text-to-speech
— Unverified 0Speaker disentanglement in video-to-speech conversion May 20, 2021 Disentanglement Speech Synthesis
Code Code Available 0Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech May 13, 2021 Decoder Speech Synthesis
Code Code Available 1Learning Robust Latent Representations for Controllable Speech Synthesis May 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Towards a practical lip-to-speech conversion system using deep neural networks and mobile application frontend Apr 29, 2021 Lip to Speech Synthesis Speech Synthesis
— Unverified 0End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Apr 27, 2021 Lip Reading Speech Synthesis
— Unverified 0Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis Apr 26, 2021 Language Modeling Language Modelling
Code Code Available 0An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 0Deep Learning Based Assessment of Synthetic Speech Naturalness Apr 23, 2021 Deep Learning Prediction
Code Code Available 1Review of end-to-end speech synthesis technology based on deep learning Apr 20, 2021 Speech Synthesis
— Unverified 0KazakhTTS: An Open-Source Kazakh Text-to-Speech Synthesis Dataset Apr 17, 2021 Speech Synthesis text-to-speech
Code Code Available 1TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction Apr 16, 2021 Speech Synthesis text-to-speech
Code Code Available 1Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis Apr 14, 2021 Dependency Parsing Representation Learning
— Unverified 0Half-Truth: A Partially Fake Audio Detection Dataset Apr 8, 2021 Speech Synthesis
Code Code Available 0Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features Apr 8, 2021 Decoder Speech Synthesis
— Unverified 0Towards Multi-Scale Style Control for Expressive Speech Synthesis Apr 8, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0