A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units Nov 12, 2022 Rhythm Voice Conversion
Code Code Available 1GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models Oct 11, 2022 Disentanglement Generative Adversarial Network
Code Code Available 1Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward Oct 2, 2022 Misinformation Speaker Verification
Code Code Available 1ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed Sep 23, 2022 Pitch control Speech Synthesis
Code Code Available 1DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion Sep 9, 2022 De-identification Speaker Verification
Code Code Available 1Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion Aug 18, 2022 Disentanglement Rhythm
Code Code Available 1A Comparative Study of Self-supervised Speech Representation Based Voice Conversion Jul 10, 2022 Voice Conversion
Code Code Available 1Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning Jun 15, 2022 Attribute Emotion Classification
Code Code Available 1Speak Like a Dog: Human to Non-human creature Voice Conversion Jun 9, 2022 Generative Adversarial Network Voice Conversion
Code Code Available 1End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions May 19, 2022 Speech Synthesis Style Transfer
Code Code Available 1Towards Improved Zero-shot Voice Conversion with Conditional DSVAE May 11, 2022 Voice Conversion
Code Code Available 1Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution Mar 31, 2022 Voice Conversion
Code Code Available 1HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 1Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion Mar 30, 2022 Data Augmentation Decoder
Code Code Available 1ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks Mar 26, 2022 Disentanglement Rhythm
Code Code Available 1Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph Feb 24, 2022 Decoder Quantization
Code Code Available 1YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone Dec 4, 2021 Speech Synthesis Text-To-Speech Synthesis
Code Code Available 1CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer Nov 30, 2021 Voice Conversion
Code Code Available 1A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion Nov 3, 2021 Representation Learning Voice Conversion
Code Code Available 1Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations Oct 27, 2021 Voice Conversion
Code Code Available 1Controllable and Interpretable Singing Voice Decomposition via Assem-VC Oct 25, 2021 Voice Conversion
Code Code Available 1LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech Oct 18, 2021 Voice Conversion
Code Code Available 1FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 1SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Toward Degradation-Robust Voice Conversion Oct 14, 2021 Denoising Speech Enhancement
Code Code Available 1S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations Oct 12, 2021 Benchmarking Voice Conversion
Code Code Available 1MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features Oct 6, 2021 Voice Conversion
Code Code Available 1Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme Sep 28, 2021 Speech Synthesis Voice Conversion
Code Code Available 1Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration Sep 12, 2021 Decoder text-to-speech
Code Code Available 1UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021 Jul 26, 2021 Audio Compression Face Swapping
Code Code Available 1StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion Jul 21, 2021 Generative Adversarial Network text-to-speech
Code Code Available 1VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion Jun 18, 2021 Disentanglement Quantization
Code Code Available 1Emotional Voice Conversion: Theory, Databases and ESD May 31, 2021 Voice Conversion
Code Code Available 1Low-Latency Real-Time Non-Parallel Voice Conversion based on Cyclic Variational Autoencoder and Multiband WaveRNN with Data-Driven Linear Prediction May 20, 2021 CPU Voice Conversion
Code Code Available 1Deep Learning Based Assessment of Synthetic Speech Naturalness Apr 23, 2021 Deep Learning Prediction
Code Code Available 1Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Apr 22, 2021 Voice Cloning Voice Conversion
Code Code Available 1S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations Apr 7, 2021 Self-Supervised Learning Voice Conversion
Code Code Available 1Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques Apr 2, 2021 Decoder Rhythm
Code Code Available 1Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Apr 1, 2021 Disentanglement Representation Learning
Code Code Available 1Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training Mar 31, 2021 text-to-speech Text to Speech
Code Code Available 1crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder Mar 4, 2021 Voice Conversion
Code Code Available 1MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames Feb 25, 2021 Voice Conversion
Code Code Available 1Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training Dec 3, 2020 Audio Generation Disentanglement
Code Code Available 1FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation Nov 11, 2020 Voice Conversion
Code Code Available 1Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset Oct 28, 2020 Decoder Emotion Recognition
Code Code Available 1One-class learning towards generalized voice spoofing detection Oct 27, 2020 Speaker Verification text-to-speech
Code Code Available 1FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention Oct 27, 2020 Disentanglement Speaker Verification
Code Code Available 1Voice Conversion Using Speech-to-Speech Neuro-Style Transfer Oct 25, 2020 Generative Adversarial Network Style Transfer
Code Code Available 1CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion Oct 22, 2020 Voice Conversion
Code Code Available 1