VoiceMixer: Adversarial Voice Style Mixup Dec 1, 2021 Disentanglement Representation Learning
— Unverified 0CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer Nov 30, 2021 Voice Conversion
Code Code Available 1One-shot Voice Conversion For Style Transfer Based On Speaker Adaptation Nov 24, 2021 Style Transfer Voice Conversion
— Unverified 0AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion Nov 12, 2021 Voice Conversion
— Unverified 0SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Nov 6, 2021 Disentanglement Speaker Verification
Code Code Available 0Voice Conversion Can Improve ASR in Very Low-Resource Settings Nov 4, 2021 Data Augmentation speech-recognition
— Unverified 0A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion Nov 3, 2021 Representation Learning Voice Conversion
Code Code Available 1Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations Oct 27, 2021 Voice Conversion
Code Code Available 1Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning Oct 27, 2021 Disentanglement Representation Learning
— Unverified 0Controllable and Interpretable Singing Voice Decomposition via Assem-VC Oct 25, 2021 Voice Conversion
Code Code Available 1Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion Oct 20, 2021 Disentanglement Voice Conversion
— Unverified 0Speech Enhancement-assisted Voice Conversion in Noisy Environments Oct 19, 2021 Speech Enhancement Voice Conversion
— Unverified 0CycleFlow: Purify Information Factors by Cycle Loss Oct 18, 2021 Voice Conversion
— Unverified 0FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 1LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech Oct 18, 2021 Voice Conversion
Code Code Available 1Towards Identity Preserving Normal to Dysarthric Voice Conversion Oct 15, 2021 Data Augmentation Decision Making
— Unverified 0SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Toward Degradation-Robust Voice Conversion Oct 14, 2021 Denoising Speech Enhancement
Code Code Available 1Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models Oct 13, 2021 Resynthesis Speaker anonymization
— Unverified 0DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Oct 13, 2021 Speech Synthesis Voice Conversion
— Unverified 0S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations Oct 12, 2021 Benchmarking Voice Conversion
Code Code Available 1Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding Oct 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features Oct 6, 2021 Voice Conversion
Code Code Available 1Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks Oct 4, 2021 Decoder Voice Conversion
Code Code Available 0Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system Oct 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0ClsVC: Learning Speech Representations with two different classification tasks. Sep 29, 2021 Classification Vocal Bursts Valence Prediction
— Unverified 0Adaptive Speech Duration Modification using a Deep-Generative Framework Sep 29, 2021 Decoder Dynamic Time Warping
— Unverified 0Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme Sep 28, 2021 Speech Synthesis Voice Conversion
Code Code Available 1Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration Sep 12, 2021 Decoder text-to-speech
Code Code Available 1Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion Sep 8, 2021 Dynamic Time Warping Speech Enhancement
— Unverified 0Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection Sep 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform Aug 12, 2021 Speaker Verification Synthetic Speech Detection
— Unverified 0StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition Aug 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations Jul 26, 2021 Voice Conversion
— Unverified 0UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021 Jul 26, 2021 Audio Compression Face Swapping
Code Code Available 1StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion Jul 21, 2021 Generative Adversarial Network text-to-speech
Code Code Available 1SVSNet: An End-to-end Speaker Voice Similarity Assessment Model Jul 20, 2021 Voice Conversion Voice Similarity
Code Code Available 0On Prosody Modeling for ASR+TTS based Voice Conversion Jul 20, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation Jul 18, 2021 Data Augmentation Emotion Recognition
Code Code Available 0Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder Jul 11, 2021 Disentanglement Voice Conversion
— Unverified 0A Deep-Bayesian Framework for Adaptive Speech Duration Modification Jul 11, 2021 Decoder Dynamic Time Warping
— Unverified 0Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Jul 8, 2021 Emotion Recognition Speech Emotion Recognition
— Unverified 0An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 0VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion Jun 18, 2021 Disentanglement Quantization
Code Code Available 1Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments Jun 16, 2021 Decoder Voice Conversion
Code Code Available 0Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion Jun 16, 2021 Style Transfer Voice Conversion
— Unverified 0Pathological voice adaptation with autoencoder-based voice conversion Jun 15, 2021 Speech Synthesis Voice Conversion
— Unverified 0A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion Jun 2, 2021 Voice Conversion
— Unverified 0NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 0Learning Paralinguistic Features from Audiobooks through Style Voice Conversion Jun 1, 2021 Emotion Recognition Style Detection
— Unverified 0