ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods Sep 15, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 1Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline Nov 29, 2022 Voice Conversion
Code Code Available 1Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders Aug 29, 2018 Voice Conversion
Code Code Available 1Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques Aug 5, 2023 Quantization Speaker anonymization
Code Code Available 1CSLP-AE: A Contrastive Split-Latent Permutation Autoencoder Framework for Zero-Shot Electroencephalography Signal Conversion Nov 13, 2023 Contrastive Learning EEG
Code Code Available 1Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training Mar 31, 2021 text-to-speech Text to Speech
Code Code Available 1VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion Jun 18, 2021 Disentanglement Quantization
Code Code Available 1What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection Dec 15, 2023 Audio Deepfake Detection Continual Learning
Code Code Available 1YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone Dec 4, 2021 Speech Synthesis Text-To-Speech Synthesis
Code Code Available 1CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion Oct 22, 2020 Voice Conversion
Code Code Available 1CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer Nov 30, 2021 Voice Conversion
Code Code Available 1AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform Dec 17, 2023 Image Segmentation Segmentation
Code Code Available 1MOSNet: Deep Learning based Objective Assessment for Voice Conversion Apr 17, 2019 Deep Learning Voice Conversion
Code Code Available 1FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 1FSD: An Initial Chinese Dataset for Fake Song Detection Sep 5, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 1F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder Apr 15, 2020 Style Transfer Voice Conversion
Code Code Available 1Deep Learning Based Assessment of Synthetic Speech Naturalness Apr 23, 2021 Deep Learning Prediction
Code Code Available 1Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion Sep 5, 2023 Voice Conversion
Code Code Available 1FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation Nov 11, 2020 Voice Conversion
Code Code Available 1GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models Oct 11, 2022 Disentanglement Generative Adversarial Network
Code Code Available 1Low-Latency Real-Time Non-Parallel Voice Conversion based on Cyclic Variational Autoencoder and Multiband WaveRNN with Data-Driven Linear Prediction May 20, 2021 CPU Voice Conversion
Code Code Available 1Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph Feb 24, 2022 Decoder Quantization
Code Code Available 1Defending Your Voice: Adversarial Attack on Voice Conversion May 18, 2020 Adversarial Attack Voice Conversion
Code Code Available 1Toward Degradation-Robust Voice Conversion Oct 14, 2021 Denoising Speech Enhancement
Code Code Available 1Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints Nov 16, 2022 Voice Conversion
— Unverified 0A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction Dec 11, 2024 Decoder Self-Supervised Learning
— Unverified 0Adaptive Speech Duration Modification using a Deep-Generative Framework Sep 29, 2021 Decoder Dynamic Time Warping
— Unverified 0DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices Aug 15, 2020 Speaker Recognition Voice Conversion
— Unverified 0Audio Deep Fake Detection System with Neural Stitching for ADD 2022 Apr 19, 2022 text-to-speech Text to Speech
— Unverified 0An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR Mar 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling Aug 9, 2020 Deep Learning Speech Synthesis
— Unverified 0Deep Learning-based F0 Synthesis for Speaker Anonymization Jun 29, 2023 Deep Learning Speaker anonymization
— Unverified 0Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning Nov 17, 2022 Binary Classification Meta-Learning
— Unverified 0Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support Apr 30, 2019 Speaker Identification Voice Conversion
— Unverified 0End-to-End Voice Conversion with Information Perturbation Jun 15, 2022 Voice Conversion
— Unverified 0DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Oct 13, 2021 Speech Synthesis Voice Conversion
— Unverified 0AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 0Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 0Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE Mar 28, 2022 Speech Synthesis Voice Conversion
— Unverified 0ACE-VC: Adaptive and Controllable Voice Conversion using Explicitly Disentangled Self-supervised Speech Representations Feb 16, 2023 Self-Supervised Learning Speaker Verification
— Unverified 0D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack Sep 11, 2024 Adversarial Attack Audio Synthesis
— Unverified 0Data Augmentation for Diverse Voice Conversion in Noisy Environments May 18, 2023 Data Augmentation Decoder
— Unverified 0Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion Dec 29, 2023 Contrastive Learning Disentanglement
— Unverified 0ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 0An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 0Emotion Intensity and its Control for Emotional Voice Conversion Jan 10, 2022 Emotion Classification Voice Conversion
— Unverified 0ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 0CycleFlow: Purify Information Factors by Cycle Loss Oct 18, 2021 Voice Conversion
— Unverified 0AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion Nov 12, 2021 Voice Conversion
— Unverified 0