Low-Latency Real-Time Non-Parallel Voice Conversion based on Cyclic Variational Autoencoder and Multiband WaveRNN with Data-Driven Linear Prediction May 20, 2021 CPU Voice Conversion
Code Code Available 15 Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques Apr 2, 2021 Decoder Rhythm
Code Code Available 15 S2VC: A Framework for Any-to-Any Voice Conversion with Self-Supervised Pretrained Representations Apr 7, 2021 Self-Supervised Learning Voice Conversion
Code Code Available 15 Rhythm Modeling for Voice Conversion Jul 12, 2023 Rhythm Voice Conversion
Code Code Available 15 FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 15 Spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source Oct 26, 2018 Information Retrieval Music Information Retrieval
Code Code Available 15 Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion Mar 30, 2022 Data Augmentation Decoder
Code Code Available 15 MOSNet: Deep Learning based Objective Assessment for Voice Conversion Apr 17, 2019 Deep Learning Voice Conversion
Code Code Available 15 Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations Oct 27, 2021 Voice Conversion
Code Code Available 15 Nonparallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks Aug 27, 2020 Voice Conversion
Code Code Available 15 End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions May 19, 2022 Speech Synthesis Style Transfer
Code Code Available 15 Non-Parallel Voice Conversion with Cyclic Variational Autoencoder Jul 24, 2019 Decoder Voice Conversion
Code Code Available 15 Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants Aug 9, 2019 Emotion Recognition Privacy Preserving
Code Code Available 15 F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder Apr 15, 2020 Style Transfer Voice Conversion
Code Code Available 15 HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods Sep 15, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 15 Robust Training of Vector Quantized Bottleneck Models May 18, 2020 Clustering Disentanglement
Code Code Available 15 Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution Mar 31, 2022 Voice Conversion
Code Code Available 15 Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 15 Pretraining Techniques for Sequence-to-Sequence Voice Conversion Aug 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion Sep 9, 2022 De-identification Speaker Verification
Code Code Available 15 Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme Sep 28, 2021 Speech Synthesis Voice Conversion
Code Code Available 15 BiSinger: Bilingual Singing Voice Synthesis Sep 25, 2023 Singing Voice Synthesis text-to-speech
Code Code Available 15 A Comparative Study of Self-supervised Speech Representation Based Voice Conversion Jul 10, 2022 Voice Conversion
Code Code Available 15 Defending Your Voice: Adversarial Attack on Voice Conversion May 18, 2020 Adversarial Attack Voice Conversion
Code Code Available 15 Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss Apr 22, 2021 Voice Cloning Voice Conversion
Code Code Available 15 Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling Sep 6, 2020 feature selection speech-recognition
Code Code Available 15 DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model Jun 18, 2023 Data Augmentation Decoder
Code Code Available 15 Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion Sep 14, 2023 Voice Conversion
Code Code Available 15 Region-Based Optimization in Continual Learning for Audio Deepfake Detection Dec 16, 2024 Audio Deepfake Detection Continual Learning
Code Code Available 15 CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion Aug 18, 2020 Prediction Voice Conversion
Code Code Available 15 Deep Learning Based Assessment of Synthetic Speech Naturalness Apr 23, 2021 Deep Learning Prediction
Code Code Available 15 Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion Sep 5, 2023 Voice Conversion
Code Code Available 15 FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation Nov 11, 2020 Voice Conversion
Code Code Available 15 Baseline System of Voice Conversion Challenge 2020 with Cyclic Variational Autoencoder and Parallel WaveGAN Oct 9, 2020 Generative Adversarial Network Task 2
Code Code Available 15 CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer Nov 30, 2021 Voice Conversion
Code Code Available 15 FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention Oct 27, 2020 Disentanglement Speaker Verification
Code Code Available 15 Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques Aug 5, 2023 Quantization Speaker anonymization
Code Code Available 15 FSD: An Initial Chinese Dataset for Fake Song Detection Sep 5, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 15 AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform Dec 17, 2023 Image Segmentation Segmentation
Code Code Available 15 ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed Sep 23, 2022 Pitch control Speech Synthesis
Code Code Available 15 Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion May 13, 2020 Decoder Voice Conversion
Code Code Available 15 AraBERT: Transformer-based Model for Arabic Language Understanding Feb 28, 2020 model named-entity-recognition
Code Code Available 15 CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion Oct 22, 2020 Voice Conversion
Code Code Available 15 Emotional Voice Conversion: Theory, Databases and ESD May 31, 2021 Voice Conversion
Code Code Available 15 Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning Jun 15, 2022 Attribute Emotion Classification
Code Code Available 15 Improving fairness for spoken language understanding in atypical speech with Text-to-Speech Nov 16, 2023 Data Augmentation Fairness
Code Code Available 15 Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion Jun 3, 2019 Audio Generation Voice Conversion
Code Code Available 15 crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder Mar 4, 2021 Voice Conversion
Code Code Available 15 A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units Nov 12, 2022 Rhythm Voice Conversion
Code Code Available 15 CSLP-AE: A Contrastive Split-Latent Permutation Autoencoder Framework for Zero-Shot Electroencephalography Signal Conversion Nov 13, 2023 Contrastive Learning EEG
Code Code Available 15