Prosody-TTS: An end-to-end speech synthesis system with prosody control Oct 6, 2021 Rhythm Speech Synthesis
— Unverified 0GANtron: Emotional Speech Synthesis with Generative Adversarial Networks Oct 6, 2021 Emotional Speech Synthesis Speech Synthesis
— Unverified 0On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis Oct 4, 2021 Knowledge Distillation Speech Synthesis
— Unverified 0Neural Speech Synthesis in German Oct 3, 2021 Speech Synthesis text-to-speech
— Unverified 0Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system Oct 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0Speech-MLP: a simple MLP architecture for speech processing Sep 29, 2021 Keyword Spotting Speech Enhancement
— Unverified 0Conditioning Sequence-to-sequence Networks with Learned Activations Sep 29, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SynCLR: A Synthesis Framework for Contrastive Learning of out-of-domain Speech Representations Sep 29, 2021 Contrastive Learning Data Augmentation
— Unverified 0Guided-TTS:Text-to-Speech with Untranscribed Speech Sep 29, 2021 Speech Synthesis text-to-speech
— Unverified 0FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis Sep 27, 2021 Density Estimation Speech Synthesis
— Unverified 0Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network Sep 22, 2021 Knowledge Distillation Language Modeling
— Unverified 0"Hello, It's Me": Deep Learning-based Speech Synthesis Attacks in the Real World Sep 20, 2021 Deep Learning Speaker Recognition
— Unverified 0On-device neural speech synthesis Sep 17, 2021 GPU Speech Synthesis
— Unverified 0fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit Sep 14, 2021 Speech Synthesis text-to-speech
Code Code Available 0Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis Sep 8, 2021 Expressive Speech Synthesis Sentence
— Unverified 0Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection Sep 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism Aug 31, 2021 Speech Synthesis
— Unverified 0Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement Aug 27, 2021 Audio Signal Processing Speech Enhancement
— Unverified 0Integrated Speech and Gesture Synthesis Aug 25, 2021 Speech Synthesis text-to-speech
Code Code Available 0A Unified Transformer-based Framework for Duplex Text Normalization Aug 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing audio quality for expressive Neural Text-to-Speech Aug 13, 2021 Acoustic Modelling Speech Synthesis
— Unverified 0A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate Aug 9, 2021 Speech Synthesis
— Unverified 0Improved pronunciation prediction accuracy using morphology Aug 1, 2021 LEMMA Morphological Inflection
— Unverified 0Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis Jul 27, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Exploring the Potential of Lexical Paraphrases for Mitigating Noise-Induced Comprehension Errors Jul 18, 2021 Speech Synthesis
— Unverified 0Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Jul 12, 2021 Prediction Speech Synthesis
Code Code Available 0Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm Jul 6, 2021 Speech Synthesis text-to-speech
— Unverified 0Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input Jul 5, 2021 Speech Synthesis text-to-speech
Code Code Available 0An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 0GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis Jun 29, 2021 Speech Synthesis text-to-speech
— Unverified 0Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance Jun 25, 2021 Quantization Speaker anonymization
— Unverified 0Distilling the Knowledge from Conditional Normalizing Flows Jun 24, 2021 Image Super-Resolution Speech Synthesis
Code Code Available 0UniTTS: Residual Learning of Unified Embedding Space for Speech Style Control Jun 21, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Controllable Context-aware Conversational Speech Synthesis Jun 21, 2021 Speech Synthesis
— Unverified 0Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis Jun 21, 2021 Speech Synthesis
— Unverified 0Non-native English lexicon creation for bilingual speech synthesis Jun 21, 2021 Speech Synthesis text-to-speech
— Unverified 0Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters Jun 19, 2021 Speech Synthesis text-to-speech
— Unverified 0EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model Jun 17, 2021 Emotional Speech Synthesis Emotion Classification
— Unverified 0A Flow-Based Neural Network for Time Domain Speech Enhancement Jun 16, 2021 Density Estimation Speech Enhancement
— Unverified 0Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis Jun 15, 2021 Speech Synthesis text-to-speech
— Unverified 0Pathological voice adaptation with autoencoder-based voice conversion Jun 15, 2021 Speech Synthesis Voice Conversion
— Unverified 0Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis Jun 12, 2021 Speech Synthesis
— Unverified 0Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache Jun 11, 2021 Speech Synthesis
— Unverified 0PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Jun 11, 2021 Audio Generation Denoising
— Unverified 0Learning to Efficiently Sample from Diffusion Probabilistic Models Jun 7, 2021 Denoising Speech Synthesis
— Unverified 0Mathematical Vocoder Algorithm : Modified Spectral Inversion for Efficient Neural Speech Synthesis Jun 6, 2021 CPU GPU
— Unverified 0Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis Jun 3, 2021 Data Augmentation Speaker Verification
— Unverified 0An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis Jun 3, 2021 Speaker Verification Speech Synthesis
— Unverified 0NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 0Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0