FSD: An Initial Chinese Dataset for Fake Song Detection Sep 5, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 15 Vector-quantized neural networks for acoustic unit discovery in the ZeroSpeech 2020 challenge May 19, 2020 Acoustic Unit Discovery Voice Conversion
Code Code Available 15 AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform Dec 17, 2023 Image Segmentation Segmentation
Code Code Available 15 Speak Like a Dog: Human to Non-human creature Voice Conversion Jun 9, 2022 Generative Adversarial Network Voice Conversion
Code Code Available 15 Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques Apr 2, 2021 Decoder Rhythm
Code Code Available 15 CSLP-AE: A Contrastive Split-Latent Permutation Autoencoder Framework for Zero-Shot Electroencephalography Signal Conversion Nov 13, 2023 Contrastive Learning EEG
Code Code Available 15 Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion Sep 5, 2023 Voice Conversion
Code Code Available 15 MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames Feb 25, 2021 Voice Conversion
Code Code Available 15 SpeechLMScore: Evaluating speech generation using speech language model Dec 8, 2022 Language Modeling Language Modelling
Code Code Available 15 Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion Aug 18, 2022 Disentanglement Rhythm
Code Code Available 15 Emotional Voice Conversion: Theory, Databases and ESD May 31, 2021 Voice Conversion
Code Code Available 15 A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units Nov 12, 2022 Rhythm Voice Conversion
Code Code Available 15 Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data Feb 1, 2020 Voice Conversion
Code Code Available 15 StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion Jul 21, 2021 Generative Adversarial Network text-to-speech
Code Code Available 15 Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion Sep 14, 2023 Voice Conversion
Code Code Available 15 StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings Sep 14, 2023 Generative Adversarial Network Voice Conversion
Code Code Available 15 F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder Apr 15, 2020 Style Transfer Voice Conversion
Code Code Available 15 Deep Learning Based Assessment of Synthetic Speech Naturalness Apr 23, 2021 Deep Learning Prediction
Code Code Available 15 Toward Degradation-Robust Voice Conversion Oct 14, 2021 Denoising Speech Enhancement
Code Code Available 15 The Singing Voice Conversion Challenge 2023 Jun 26, 2023 Voice Conversion
Code Code Available 15 Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion Jan 22, 2020 Disentanglement Voice Conversion
Code Code Available 15 Towards Improved Zero-shot Voice Conversion with Conditional DSVAE May 11, 2022 Voice Conversion
Code Code Available 15 Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders Aug 29, 2018 Voice Conversion
Code Code Available 15 Defending Your Voice: Adversarial Attack on Voice Conversion May 18, 2020 Adversarial Attack Voice Conversion
Code Code Available 15 Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining Dec 14, 2019 text-to-speech Text to Speech
Code Code Available 15 An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation Jul 18, 2021 Data Augmentation Emotion Recognition
Code Code Available 05 A Speech Representation Anonymization Framework via Selective Noise Perturbation Mar 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning Jun 5, 2020 Self-Supervised Learning Speaker Verification
Code Code Available 05 Deep Residual Neural Networks for Audio Spoofing Detection Jun 30, 2019 Speaker Verification Speech Synthesis
Code Code Available 05 Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages May 20, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS Oct 6, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion Oct 16, 2020 Speech Synthesis text-to-speech
Code Code Available 05 Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks Sep 23, 2017 Speech Synthesis text-to-speech
Code Code Available 05 STC Antispoofing Systems for the ASVspoof2019 Challenge Apr 11, 2019 Speech Synthesis Voice Conversion
Code Code Available 05 Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks Oct 4, 2021 Decoder Voice Conversion
Code Code Available 05 Spoof detection using time-delay shallow neural network and feature switching Apr 16, 2019 Speaker Verification Speech Synthesis
Code Code Available 05 StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion Jul 29, 2019 Voice Conversion
Code Code Available 05 AdaGAN: Adaptive GAN for Many-to-Many Non-Parallel Voice Conversion Sep 25, 2019 Generative Adversarial Network Style Transfer
Code Code Available 05 SVSNet: An End-to-end Speaker Voice Similarity Assessment Model Jul 20, 2021 Voice Conversion Voice Similarity
Code Code Available 05 CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion Apr 9, 2019 Voice Conversion
Code Code Available 05 ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks Apr 1, 2019 Feature Engineering text-to-speech
Code Code Available 05 SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Nov 6, 2021 Disentanglement Speaker Verification
Code Code Available 05 ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder Aug 13, 2018 Attribute Decoder
Code Code Available 05 Scalable Factorized Hierarchical Variational Autoencoder Training Apr 9, 2018 Disentanglement Hyperparameter Optimization
Code Code Available 05 Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks Nov 30, 2017 Voice Conversion
Code Code Available 05 Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge Feb 18, 2025 Voice Conversion
Code Code Available 05 NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 05 Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis Feb 28, 2020 Speech Synthesis text-to-speech
Code Code Available 05 A Practical Guide to Logical Access Voice Presentation Attack Detection Jan 10, 2022 Artifact Detection Speaker Verification
Code Code Available 05 Multi-task learning improves synthetic speech detection Apr 27, 2022 Multi-Task Learning Speaker Verification
Code Code Available 05