DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding May 21, 2023 Data Augmentation Decoder
— Unverified 00 Effects of Convolutional Autoencoder Bottleneck Width on StarGAN-based Singing Technique Conversion Aug 19, 2023 Voice Conversion
— Unverified 00 EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models Aug 28, 2024 Attribute Backdoor Attack
— Unverified 00 EmoCat: Language-agnostic Emotional Voice Conversion Jan 14, 2021 Decoder text-to-speech
— Unverified 00 EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion Dec 29, 2024 Self-Supervised Learning Voice Conversion
— Unverified 00 Emotion Intensity and its Control for Emotional Voice Conversion Jan 10, 2022 Emotion Classification Voice Conversion
— Unverified 00 End-to-End Voice Conversion with Information Perturbation Jun 15, 2022 Voice Conversion
— Unverified 00 Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations Feb 5, 2024 Decoder In-Context Learning
— Unverified 00 Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE Mar 30, 2022 Decoder Sentence
— Unverified 00 Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion Jun 16, 2021 Style Transfer Voice Conversion
— Unverified 00 Error Reduction Network for DBLSTM-based Voice Conversion Sep 26, 2018 Voice Conversion
— Unverified 00 Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation May 25, 2025 Disentanglement Self-Supervised Learning
— Unverified 00 Evaluating Voice Conversion-based Privacy Protection against Informed Attackers Nov 10, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 00 Exploring data augmentation in bias mitigation against non-native-accented speech Dec 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Exploring synthetic data for cross-speaker style transfer in style representation based TTS Sep 25, 2024 Style Transfer text-to-speech
— Unverified 00 Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models Oct 13, 2021 Resynthesis Speaker anonymization
— Unverified 00 Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features Nov 9, 2022 Decoder Voice Conversion
— Unverified 00 Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Jul 8, 2021 Emotion Recognition Speech Emotion Recognition
— Unverified 00 EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion May 22, 2025 Decoder Voice Conversion
— Unverified 00 Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment Sep 18, 2023 Voice Conversion
— Unverified 00 Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos Jun 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning Apr 22, 2025 Deep Learning Speaker Verification
— Unverified 00 Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion Jun 28, 2023 Backdoor Attack Voice Conversion
— Unverified 00 FastVC: Fast Voice Conversion with non-parallel data Oct 8, 2020 Voice Conversion
— Unverified 00 FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation Sep 3, 2024 Voice Conversion
— Unverified 00 FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks Feb 6, 2025 Resynthesis Voice Conversion
— Unverified 00 Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation Jan 24, 2025 Audio Deepfake Detection DeepFake Detection
— Unverified 00 Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations Aug 24, 2023 Representation Learning Speech Synthesis
— Unverified 00 Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion Jul 27, 2019 Voice Conversion
— Unverified 00 Generating and Detecting Various Types of Fake Image and Audio Content: A Review of Modern Deep Learning Technologies and Tools Jan 7, 2025 Face Swapping Voice Conversion
— Unverified 00 Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements Apr 27, 2025 Generative Adversarial Network Speech Synthesis
— Unverified 00 GenVC: Self-Supervised Zero-Shot Voice Conversion Feb 6, 2025 Voice Conversion
— Unverified 00 GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion Jul 4, 2022 Voice Conversion
— Unverified 00 GPU-Friendly Local Regression for Voice Conversion May 1, 2015 CPU GPU
— Unverified 00 Hierarchical disentangled representation learning for singing voice conversion Jan 18, 2021 Representation Learning Voice Conversion
— Unverified 00 Hierarchical Sequence to Sequence Voice Conversion with Limited Data Jul 15, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 High Fidelity Speech Regeneration with Application to Speech Enhancement Jan 31, 2021 Denoising Speaker Separation
— Unverified 00 High-quality nonparallel voice conversion based on cycle-consistent adversarial network Apr 2, 2018 Generative Adversarial Network Image-to-Image Translation
— Unverified 00 HLTCOE JHU Submission to the Voice Privacy Challenge 2024 Sep 13, 2024 text-to-speech Text to Speech
— Unverified 00 How Far Are We from Robust Voice Conversion: A Survey Nov 24, 2020 Speaker Identification Survey
— Unverified 00 Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems Jun 18, 2022 Speaker Identification Speaker Verification
— Unverified 00 Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion Oct 20, 2021 Disentanglement Voice Conversion
— Unverified 00 Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder Nov 15, 2022 Contrastive Learning Disentanglement
— Unverified 00 Improve few-shot voice cloning using multi-modal learning Mar 18, 2022 text-to-speech Text to Speech
— Unverified 00 Improving child speech recognition with augmented child-like speech Jun 12, 2024 speech-recognition Speech Recognition
— Unverified 00 Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses Sep 15, 2023 Speaker Verification Voice Conversion
— Unverified 00 Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses Oct 20, 2024 Voice Conversion
— Unverified 00 Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning Mar 17, 2021 Decoder Representation Learning
— Unverified 00