Non-autoregressive sequence-to-sequence voice conversion Apr 14, 2021 text-to-speech Text to Speech
— Unverified 0NoiseVC: Towards High Quality Zero-Shot Voice Conversion Apr 13, 2021 Disentanglement Quantization
— Unverified 0Utilizing Self-supervised Representations for MOS Prediction Apr 7, 2021 Prediction Voice Conversion
Code Code Available 0StarGAN-based Emotional Voice Conversion for Japanese Phrases Apr 5, 2021 Voice Conversion
— Unverified 0Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning Mar 17, 2021 Decoder Representation Learning
— Unverified 0Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech Mar 6, 2021 text-to-speech Text to Speech
— Unverified 0Axial Residual Networks for CycleGAN-based Voice Conversion Feb 16, 2021 Voice Conversion
— Unverified 0ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 0Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram Feb 3, 2021 text-to-speech Text to Speech
— Unverified 0High Fidelity Speech Regeneration with Application to Speech Enhancement Jan 31, 2021 Denoising Speaker Separation
— Unverified 0Adversarially learning disentangled speech representations for robust multi-factor voice conversion Jan 30, 2021 Representation Learning Rhythm
— Unverified 0Hierarchical disentangled representation learning for singing voice conversion Jan 18, 2021 Representation Learning Voice Conversion
— Unverified 0EmoCat: Language-agnostic Emotional Voice Conversion Jan 14, 2021 Decoder text-to-speech
— Unverified 0Joint Audio-Visual Deepfake Detection Jan 1, 2021 DeepFake Detection Face Swapping
— Unverified 0Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 0How Far Are We from Robust Voice Conversion: A Survey Nov 24, 2020 Speaker Identification Survey
— Unverified 0Low-resource expressive text-to-speech using data augmentation Nov 11, 2020 Data Augmentation text-to-speech
— Unverified 0Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion Nov 3, 2020 speech-recognition Speech Recognition
— Unverified 0VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech Nov 3, 2020 Decoder Disentanglement
— Unverified 0Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech Nov 2, 2020 Knowledge Distillation Speech Synthesis
— Unverified 0PPG-based singing voice conversion with adversarial representation learning Oct 28, 2020 Representation Learning Voice Conversion
— Unverified 0Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations Oct 23, 2020 Voice Conversion
— Unverified 0Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion Oct 16, 2020 Speech Synthesis text-to-speech
— Unverified 0The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders Oct 9, 2020 Task 2 Voice Conversion
— Unverified 0Latent linguistic embedding for cross-lingual text-to-speech and voice conversion Oct 8, 2020 text-to-speech Text to Speech
— Unverified 0FastVC: Fast Voice Conversion with non-parallel data Oct 8, 2020 Voice Conversion
— Unverified 0The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS Oct 6, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0The Academia Sinica Systems of Voice Conversion for VCC2020 Oct 6, 2020 Task 2 Voice Conversion
— Unverified 0When Automatic Voice Disguise Meets Automatic Speaker Verification Sep 15, 2020 Miscellaneous Speaker Verification
— Unverified 0Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Aug 28, 2020 Voice Conversion
— Unverified 0DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices Aug 15, 2020 Speaker Recognition Voice Conversion
— Unverified 0Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN Aug 11, 2020 Voice Conversion
— Unverified 0VAW-GAN for Singing Voice Conversion with Non-parallel Training Data Aug 10, 2020 Decoder Generative Adversarial Network
— Unverified 0Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling Aug 9, 2020 Deep Learning Speech Synthesis
— Unverified 0Unsupervised Cross-Domain Singing Voice Conversion Aug 6, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Neural MOS Prediction for Synthesized Speech Using Multi-Task Learning With Spoofing Detection and Spoofing Type Classification Jul 16, 2020 Multi-Task Learning Prediction
— Unverified 0Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning Jun 5, 2020 Self-Supervised Learning Speaker Verification
Code Code Available 0NAUTILUS: a Versatile Voice Cloning System May 22, 2020 Speech Synthesis text-to-speech
— Unverified 0Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Many-to-Many Voice Transformer Network May 18, 2020 Voice Conversion
— Unverified 0Vowels and Prosody Contribution in Neural Network Based Voice Conversion Algorithm with Noisy Training Data Mar 10, 2020 Voice Conversion
— Unverified 0Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis Feb 28, 2020 Speech Synthesis text-to-speech
Code Code Available 0Many-to-Many Voice Conversion using Conditional Cycle-Consistent Adversarial Networks Feb 15, 2020 Generative Adversarial Network Voice Conversion
— Unverified 0Vocoder-free End-to-End Voice Conversion with Transformer Network Feb 5, 2020 speech-recognition Speech Recognition
Code Code Available 0Mel-spectrogram augmentation for sequence to sequence voice conversion Jan 6, 2020 Voice Conversion
Code Code Available 0Learning Singing From Speech Dec 20, 2019 Speech Synthesis Voice Conversion
— Unverified 0Voice Conversion for Whispered Speech Synthesis Dec 11, 2019 Speech Synthesis Voice Conversion
— Unverified 0Towards Robust Neural Vocoding for Speech Generation: A Survey Dec 5, 2019 Speech Synthesis Survey
— Unverified 0PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network Dec 4, 2019 Decoder Music Generation
— Unverified 0