StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts May 31, 2021 Voice Conversion
— Unverified 0Emotional Voice Conversion: Theory, Databases and ESD May 31, 2021 Voice Conversion
Code Code Available 1DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion May 28, 2021 Denoising Voice Conversion
— Unverified 0Low-Latency Real-Time Non-Parallel Voice Conversion based on Cyclic Variational Autoencoder and Multiband WaveRNN with Data-Driven Linear Prediction May 20, 2021 CPU Voice Conversion
Code Code Available 1Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery May 4, 2021 Acoustic Unit Discovery Voice Conversion
— Unverified 0An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 0Deep Learning Based Assessment of Synthetic Speech Naturalness Apr 23, 2021 Deep Learning Prediction
Code Code Available 1Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Apr 22, 2021 Voice Cloning Voice Conversion
Code Code Available 1Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels Apr 15, 2021 Voice Conversion
— Unverified 0Non-autoregressive sequence-to-sequence voice conversion Apr 14, 2021 text-to-speech Text to Speech
— Unverified 0NoiseVC: Towards High Quality Zero-Shot Voice Conversion Apr 13, 2021 Disentanglement Quantization
— Unverified 0S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations Apr 7, 2021 Self-Supervised Learning Voice Conversion
Code Code Available 1Utilizing Self-supervised Representations for MOS Prediction Apr 7, 2021 Prediction Voice Conversion
Code Code Available 0StarGAN-based Emotional Voice Conversion for Japanese Phrases Apr 5, 2021 Voice Conversion
— Unverified 0Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques Apr 2, 2021 Decoder Rhythm
Code Code Available 1Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Apr 1, 2021 Disentanglement Representation Learning
Code Code Available 1Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training Mar 31, 2021 text-to-speech Text to Speech
Code Code Available 1Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning Mar 17, 2021 Decoder Representation Learning
— Unverified 0Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech Mar 6, 2021 text-to-speech Text to Speech
— Unverified 0crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder Mar 4, 2021 Voice Conversion
Code Code Available 1MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames Feb 25, 2021 Voice Conversion
Code Code Available 1Axial Residual Networks for CycleGAN-based Voice Conversion Feb 16, 2021 Voice Conversion
— Unverified 0ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 0Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram Feb 3, 2021 text-to-speech Text to Speech
— Unverified 0High Fidelity Speech Regeneration with Application to Speech Enhancement Jan 31, 2021 Denoising Speaker Separation
— Unverified 0Adversarially learning disentangled speech representations for robust multi-factor voice conversion Jan 30, 2021 Representation Learning Rhythm
— Unverified 0Hierarchical disentangled representation learning for singing voice conversion Jan 18, 2021 Representation Learning Voice Conversion
— Unverified 0EmoCat: Language-agnostic Emotional Voice Conversion Jan 14, 2021 Decoder text-to-speech
— Unverified 0Joint Audio-Visual Deepfake Detection Jan 1, 2021 DeepFake Detection Face Swapping
— Unverified 0Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 0Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training Dec 3, 2020 Audio Generation Disentanglement
Code Code Available 1How Far Are We from Robust Voice Conversion: A Survey Nov 24, 2020 Speaker Identification Survey
— Unverified 0Low-resource expressive text-to-speech using data augmentation Nov 11, 2020 Data Augmentation text-to-speech
— Unverified 0FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation Nov 11, 2020 Voice Conversion
Code Code Available 1Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion Nov 3, 2020 speech-recognition Speech Recognition
— Unverified 0VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech Nov 3, 2020 Decoder Disentanglement
— Unverified 0Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech Nov 2, 2020 Knowledge Distillation Speech Synthesis
— Unverified 0Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset Oct 28, 2020 Decoder Emotion Recognition
Code Code Available 1PPG-based singing voice conversion with adversarial representation learning Oct 28, 2020 Representation Learning Voice Conversion
— Unverified 0One-class learning towards generalized voice spoofing detection Oct 27, 2020 Speaker Verification text-to-speech
Code Code Available 1FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention Oct 27, 2020 Disentanglement Speaker Verification
Code Code Available 1Voice Conversion Using Speech-to-Speech Neuro-Style Transfer Oct 25, 2020 Generative Adversarial Network Style Transfer
Code Code Available 1Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations Oct 23, 2020 Voice Conversion
— Unverified 0CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion Oct 22, 2020 Voice Conversion
Code Code Available 1Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion Oct 16, 2020 Speech Synthesis text-to-speech
— Unverified 0Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN Oct 9, 2020 Generative Adversarial Network Task 2
Code Code Available 1The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders Oct 9, 2020 Task 2 Voice Conversion
— Unverified 0FastVC: Fast Voice Conversion with non-parallel data Oct 8, 2020 Voice Conversion
— Unverified 0Latent linguistic embedding for cross-lingual text-to-speech and voice conversion Oct 8, 2020 text-to-speech Text to Speech
— Unverified 0The Academia Sinica Systems of Voice Conversion for VCC2020 Oct 6, 2020 Task 2 Voice Conversion
— Unverified 0