Cross-speaker style transfer for text-to-speech using data augmentation Feb 10, 2022 Data Augmentation Style Transfer
— Unverified 0Invertible Voice Conversion Jan 26, 2022 Voice Conversion
— Unverified 0The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition Jan 13, 2022 Generative Adversarial Network Phoneme Recognition
— Unverified 0Emotion Intensity and its Control for Emotional Voice Conversion Jan 10, 2022 Emotion Classification Voice Conversion
— Unverified 0A Practical Guide to Logical Access Voice Presentation Attack Detection Jan 10, 2022 Artifact Detection Speaker Verification
Code Code Available 0Adversarial Transformation of Spoofing Attacks for Voice Biometrics Jan 4, 2022 Speaker Verification Voice Conversion
— Unverified 0IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion Jan 2, 2022 Quantization Voice Conversion
— Unverified 0The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices Dec 15, 2021 Speaker Identification Voice Conversion
— Unverified 0Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features Dec 8, 2021 Decoder Self-Supervised Learning
— Unverified 0Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion Dec 6, 2021 Decoder Voice Conversion
— Unverified 0VoiceMixer: Adversarial Voice Style Mixup Dec 1, 2021 Disentanglement Representation Learning
— Unverified 0One-shot Voice Conversion For Style Transfer Based On Speaker Adaptation Nov 24, 2021 Style Transfer Voice Conversion
— Unverified 0AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion Nov 12, 2021 Voice Conversion
— Unverified 0SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Nov 6, 2021 Disentanglement Speaker Verification
Code Code Available 0Voice Conversion Can Improve ASR in Very Low-Resource Settings Nov 4, 2021 Data Augmentation speech-recognition
— Unverified 0Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning Oct 27, 2021 Disentanglement Representation Learning
— Unverified 0Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion Oct 20, 2021 Disentanglement Voice Conversion
— Unverified 0Speech Enhancement-assisted Voice Conversion in Noisy Environments Oct 19, 2021 Speech Enhancement Voice Conversion
— Unverified 0CycleFlow: Purify Information Factors by Cycle Loss Oct 18, 2021 Voice Conversion
— Unverified 0Towards Identity Preserving Normal to Dysarthric Voice Conversion Oct 15, 2021 Data Augmentation Decision Making
— Unverified 0Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models Oct 13, 2021 Resynthesis Speaker anonymization
— Unverified 0DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Oct 13, 2021 Speech Synthesis Voice Conversion
— Unverified 0Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding Oct 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks Oct 4, 2021 Decoder Voice Conversion
Code Code Available 0Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system Oct 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0Adaptive Speech Duration Modification using a Deep-Generative Framework Sep 29, 2021 Decoder Dynamic Time Warping
— Unverified 0ClsVC: Learning Speech Representations with two different classification tasks. Sep 29, 2021 Classification Vocal Bursts Valence Prediction
— Unverified 0Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion Sep 8, 2021 Dynamic Time Warping Speech Enhancement
— Unverified 0Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection Sep 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform Aug 12, 2021 Speaker Verification Synthetic Speech Detection
— Unverified 0StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition Aug 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations Jul 26, 2021 Voice Conversion
— Unverified 0SVSNet: An End-to-end Speaker Voice Similarity Assessment Model Jul 20, 2021 Voice Conversion Voice Similarity
Code Code Available 0On Prosody Modeling for ASR+TTS based Voice Conversion Jul 20, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation Jul 18, 2021 Data Augmentation Emotion Recognition
Code Code Available 0Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder Jul 11, 2021 Disentanglement Voice Conversion
— Unverified 0A Deep-Bayesian Framework for Adaptive Speech Duration Modification Jul 11, 2021 Decoder Dynamic Time Warping
— Unverified 0Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Jul 8, 2021 Emotion Recognition Speech Emotion Recognition
— Unverified 0An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 0Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments Jun 16, 2021 Decoder Voice Conversion
Code Code Available 0Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion Jun 16, 2021 Style Transfer Voice Conversion
— Unverified 0Pathological voice adaptation with autoencoder-based voice conversion Jun 15, 2021 Speech Synthesis Voice Conversion
— Unverified 0A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion Jun 2, 2021 Voice Conversion
— Unverified 0NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 0Learning Paralinguistic Features from Audiobooks through Style Voice Conversion Jun 1, 2021 Emotion Recognition Style Detection
— Unverified 0StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts May 31, 2021 Voice Conversion
— Unverified 0DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion May 28, 2021 Denoising Voice Conversion
— Unverified 0Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery May 4, 2021 Acoustic Unit Discovery Voice Conversion
— Unverified 0An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 0Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels Apr 15, 2021 Voice Conversion
— Unverified 0