Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN Oct 9, 2020 Generative Adversarial Network Task 2
Code Code Available 1Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion Sep 30, 2020 Transfer Learning Voice Conversion
Code Code Available 1Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling Sep 6, 2020 feature selection speech-recognition
Code Code Available 1Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks Aug 27, 2020 Voice Conversion
Code Code Available 1CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion Aug 18, 2020 Prediction Voice Conversion
Code Code Available 1Pretraining Techniques for Sequence-to-Sequence Voice Conversion Aug 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1VQVC+: One-Shot Voice Conversion by Vector Quantization and U-Net architecture Jun 7, 2020 Disentanglement Quantization
Code Code Available 1Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge May 19, 2020 Acoustic Unit Discovery Voice Conversion
Code Code Available 1Defending Your Voice: Adversarial Attack on Voice Conversion May 18, 2020 Adversarial Attack Voice Conversion
Code Code Available 1Robust Training of Vector Quantized Bottleneck Models May 18, 2020 Clustering Disentanglement
Code Code Available 1Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion May 13, 2020 Decoder Voice Conversion
Code Code Available 1F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder Apr 15, 2020 Style Transfer Voice Conversion
Code Code Available 1AraBERT: Transformer-based Model for Arabic Language Understanding Feb 28, 2020 model named-entity-recognition
Code Code Available 1Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data Feb 1, 2020 Voice Conversion
Code Code Available 1Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion Jan 22, 2020 Disentanglement Voice Conversion
Code Code Available 1Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining Dec 14, 2019 text-to-speech Text to Speech
Code Code Available 1Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants Aug 9, 2019 Emotion Recognition Privacy Preserving
Code Code Available 1Non-Parallel Voice Conversion with Cyclic Variational Autoencoder Jul 24, 2019 Decoder Voice Conversion
Code Code Available 1Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion Jun 3, 2019 Audio Generation Voice Conversion
Code Code Available 1Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion May 2, 2019 Decoder Disentanglement
Code Code Available 1MOSNet: Deep Learning based Objective Assessment for Voice Conversion Apr 17, 2019 Deep Learning Voice Conversion
Code Code Available 1One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization Apr 10, 2019 Voice Conversion
Code Code Available 1Spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source Oct 26, 2018 Information Retrieval Music Information Retrieval
Code Code Available 1Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders Aug 29, 2018 Voice Conversion
Code Code Available 1StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks Jun 6, 2018 Attribute Generative Adversarial Network
Code Code Available 1RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding Jun 12, 2025 CPU Voice Conversion
— Unverified 0CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition Jun 6, 2025 Emotion Recognition Fairness
— Unverified 0Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion Jun 4, 2025 Disentanglement Style Transfer
— Unverified 0StarVC: A Unified Auto-Regressive Framework for Joint Text and Speech Generation in Voice Conversion Jun 3, 2025 Voice Conversion
— Unverified 0Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech Jun 2, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0LinearVC: Linear transformations of self-supervised features through the lens of voice conversion Jun 2, 2025 Voice Conversion
— Unverified 0SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction Jun 2, 2025 Speech Synthesis text-to-speech
— Unverified 0Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching Jun 1, 2025 Rhythm Style Transfer
— Unverified 0PseudoVC: Improving One-shot Voice Conversion with Pseudo Paired Data Jun 1, 2025 Voice Conversion
— Unverified 0A Perception-Based L2 Speech Intelligibility Indicator: Leveraging a Rater's Shadowing and Sequence-to-sequence Voice Conversion May 30, 2025 Voice Conversion
— Unverified 0Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification May 30, 2025 Dialect Identification Voice Conversion
— Unverified 0When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds May 30, 2025 Voice Conversion
— Unverified 0Discl-VC: Disentangled Discrete Tokens and In-Context Learning for Controllable Zero-Shot Voice Conversion May 30, 2025 In-Context Learning Voice Conversion
— Unverified 0PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts May 27, 2025 Diversity Rhythm
— Unverified 0REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion May 27, 2025 Disentanglement Speaker Identification
— Unverified 0VibE-SVC: Vibrato Extraction with High-frequency F0 Contour for Singing Voice Conversion May 27, 2025 Voice Conversion
— Unverified 0ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis May 26, 2025 DeepFake Detection Face Swapping
— Unverified 0Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation May 25, 2025 Disentanglement Self-Supervised Learning
— Unverified 0Private kNN-VC: Interpretable Anonymization of Converted Speech May 23, 2025 Speaker anonymization Speaker Recognition
Code Code Available 0EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion May 22, 2025 Decoder Voice Conversion
— Unverified 0ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech May 20, 2025 Voice Conversion
— Unverified 0Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages May 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Investigating self-supervised features for expressive, multilingual voice conversion May 13, 2025 Self-Supervised Learning Speech Synthesis
— Unverified 0Discrete Optimal Transport and Voice Conversion May 7, 2025 Audio Generation Voice Conversion
— Unverified 0Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements Apr 27, 2025 Generative Adversarial Network Speech Synthesis
— Unverified 0