Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph Feb 24, 2022 Decoder Quantization
Code Code Available 1Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques Apr 2, 2021 Decoder Rhythm
Code Code Available 1Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion Mar 30, 2022 Data Augmentation Decoder
Code Code Available 1Robust Training of Vector Quantized Bottleneck Models May 18, 2020 Clustering Disentanglement
Code Code Available 1CSLP-AE: A Contrastive Split-Latent Permutation Autoencoder Framework for Zero-Shot Electroencephalography Signal Conversion Nov 13, 2023 Contrastive Learning EEG
Code Code Available 1MOSNet: Deep Learning based Objective Assessment for Voice Conversion Apr 17, 2019 Deep Learning Voice Conversion
Code Code Available 1Where are we in audio deepfake detection? A systematic analysis over generative and detection models Oct 6, 2024 Audio Deepfake Detection Audio Synthesis
Code Code Available 1Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units Dec 19, 2022 Rhythm Voice Conversion
Code Code Available 1Investigation of F0 conditioning and Fully Convolutional Networks in Variational Autoencoder based Voice Conversion May 2, 2019 Decoder Disentanglement
Code Code Available 1SpeechLMScore: Evaluating speech generation using speech language model Dec 8, 2022 Language Modeling Language Modelling
Code Code Available 1HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods Sep 15, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 1Speech Resynthesis from Discrete Disentangled Self-Supervised Representations Apr 1, 2021 Disentanglement Representation Learning
Code Code Available 1GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models Oct 11, 2022 Disentanglement Generative Adversarial Network
Code Code Available 1FSD: An Initial Chinese Dataset for Fake Song Detection Sep 5, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 1HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 1kNN-SVC: Robust Zero-Shot Singing Voice Conversion with Additive Synthesis and Concatenation Smoothness Optimization Apr 8, 2025 Voice Conversion
Code Code Available 1Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations Oct 27, 2021 Voice Conversion
Code Code Available 1FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation Nov 11, 2020 Voice Conversion
Code Code Available 1Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Apr 22, 2021 Voice Cloning Voice Conversion
Code Code Available 1End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions May 19, 2022 Speech Synthesis Style Transfer
Code Code Available 1Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion Sep 14, 2023 Voice Conversion
Code Code Available 1Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution Mar 31, 2022 Voice Conversion
Code Code Available 1Emotional Voice Conversion: Theory, Databases and ESD May 31, 2021 Voice Conversion
Code Code Available 1BiSinger: Bilingual Singing Voice Synthesis Sep 25, 2023 Singing Voice Synthesis text-to-speech
Code Code Available 1A Comparative Study of Self-supervised Speech Representation Based Voice Conversion Jul 10, 2022 Voice Conversion
Code Code Available 1Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling Sep 6, 2020 feature selection speech-recognition
Code Code Available 1Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion Sep 5, 2023 Voice Conversion
Code Code Available 1F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder Apr 15, 2020 Style Transfer Voice Conversion
Code Code Available 1DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model Jun 18, 2023 Data Augmentation Decoder
Code Code Available 1CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion Aug 18, 2020 Prediction Voice Conversion
Code Code Available 1Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants Aug 9, 2019 Emotion Recognition Privacy Preserving
Code Code Available 1Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline Nov 29, 2022 Voice Conversion
Code Code Available 1FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 1Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN Oct 9, 2020 Generative Adversarial Network Task 2
Code Code Available 1Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme Sep 28, 2021 Speech Synthesis Voice Conversion
Code Code Available 1Improving fairness for spoken language understanding in atypical speech with Text-to-Speech Nov 16, 2023 Data Augmentation Fairness
Code Code Available 1Defending Your Voice: Adversarial Attack on Voice Conversion May 18, 2020 Adversarial Attack Voice Conversion
Code Code Available 1LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech Oct 18, 2021 Voice Conversion
Code Code Available 1Controllable and Interpretable Singing Voice Decomposition via Assem-VC Oct 25, 2021 Voice Conversion
Code Code Available 1ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed Sep 23, 2022 Pitch control Speech Synthesis
Code Code Available 1Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques Aug 5, 2023 Quantization Speaker anonymization
Code Code Available 1DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion Sep 9, 2022 De-identification Speaker Verification
Code Code Available 1MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames Feb 25, 2021 Voice Conversion
Code Code Available 1Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 1AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform Dec 17, 2023 Image Segmentation Segmentation
Code Code Available 1CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer Nov 30, 2021 Voice Conversion
Code Code Available 1Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion Jun 3, 2019 Audio Generation Voice Conversion
Code Code Available 1crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder Mar 4, 2021 Voice Conversion
Code Code Available 1A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units Nov 12, 2022 Rhythm Voice Conversion
Code Code Available 1CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion Oct 22, 2020 Voice Conversion
Code Code Available 1