AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion Nov 12, 2021 Voice Conversion
— Unverified 0Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion Nov 24, 2023 Data Augmentation Retrieval
— Unverified 0CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching Nov 4, 2024 Speaker Verification Voice Conversion
— Unverified 0ALO-VC: Any-to-any Low-latency One-shot Voice Conversion Jun 1, 2023 CPU Voice Conversion
— Unverified 0Automatic Voice Identification after Speech Resynthesis using PPG Aug 5, 2024 Resynthesis Speaker Verification
— Unverified 0Cross-speaker style transfer for text-to-speech using data augmentation Feb 10, 2022 Data Augmentation Style Transfer
— Unverified 0Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation Apr 21, 2022 Data Augmentation text-to-speech
— Unverified 0Crossmodal Voice Conversion Apr 9, 2019 Decoder Voice Conversion
— Unverified 0A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment Apr 23, 2018 Benchmarking Speaker Verification
— Unverified 0AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment May 8, 2023 cross-modal alignment Rhythm
— Unverified 0High-quality nonparallel voice conversion based on cycle-consistent adversarial network Apr 2, 2018 Generative Adversarial Network Image-to-Image Translation
— Unverified 0Cross-modal Face- and Voice-style Transfer Feb 27, 2023 Diversity Image-to-Image Translation
— Unverified 0Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation Oct 31, 2022 Decoder Disentanglement
— Unverified 0Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech Sep 15, 2023 Knowledge Distillation Speech Synthesis
— Unverified 0Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models May 27, 2023 Speech Synthesis Voice Conversion
— Unverified 0ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis May 26, 2025 DeepFake Detection Face Swapping
— Unverified 0A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion Jun 28, 2022 Speaker Recognition Voice Conversion
— Unverified 0A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech May 8, 2018 Language Identification Speech Synthesis
— Unverified 0Generating and Detecting Various Types of Fake Image and Audio Content: A Review of Modern Deep Learning Technologies and Tools Jan 7, 2025 Face Swapping Voice Conversion
— Unverified 0AE-Flow: AutoEncoder Normalizing Flow Dec 27, 2023 text-to-speech Text to Speech
— Unverified 0Learning Speech Representation From Contrastive Token-Acoustic Pretraining Sep 1, 2023 Audio Classification Automatic Speech Recognition
— Unverified 0Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation Jan 24, 2025 Audio Deepfake Detection DeepFake Detection
— Unverified 0CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition Jun 6, 2025 Emotion Recognition Fairness
— Unverified 0High Fidelity Speech Regeneration with Application to Speech Enhancement Jan 31, 2021 Denoising Speaker Separation
— Unverified 0HLTCOE JHU Submission to the Voice Privacy Challenge 2024 Sep 13, 2024 text-to-speech Text to Speech
— Unverified 0Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder Nov 15, 2022 Contrastive Learning Disentanglement
— Unverified 0ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion Nov 5, 2018 Speech Enhancement Voice Conversion
— Unverified 0Are disentangled representations all you need to build speaker anonymization systems? Aug 22, 2022 All Automatic Speech Recognition
— Unverified 0FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation Sep 3, 2024 Voice Conversion
— Unverified 0FastVC: Fast Voice Conversion with non-parallel data Oct 8, 2020 Voice Conversion
— Unverified 0Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model May 2, 2024 Denoising Emotion Recognition
— Unverified 0Adversarial Transformation of Spoofing Attacks for Voice Biometrics Jan 4, 2022 Speaker Verification Voice Conversion
— Unverified 0Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion Jun 28, 2023 Backdoor Attack Voice Conversion
— Unverified 0FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning Apr 22, 2025 Deep Learning Speaker Verification
— Unverified 0Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos Jun 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers Jan 30, 2024 Voice Conversion
— Unverified 0FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks Feb 6, 2025 Resynthesis Voice Conversion
— Unverified 0Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment Sep 18, 2023 Voice Conversion
— Unverified 0Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion Dec 6, 2021 Decoder Voice Conversion
— Unverified 0EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion May 22, 2025 Decoder Voice Conversion
— Unverified 0Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Jul 8, 2021 Emotion Recognition Speech Emotion Recognition
— Unverified 0Comparison of Speech Representations for the MOS Prediction System Jun 28, 2022 Self-Supervised Learning text-to-speech
— Unverified 0Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations Aug 24, 2023 Representation Learning Speech Synthesis
— Unverified 0Generalization of Spectrum Differential based Direct Waveform Modification for Voice Conversion Jul 27, 2019 Voice Conversion
— Unverified 0A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion Jun 2, 2021 Voice Conversion
— Unverified 0Generative Adversarial Network based Voice Conversion: Techniques, Challenges, and Recent Advancements Apr 27, 2025 Generative Adversarial Network Speech Synthesis
— Unverified 0Adversarial speech for voice privacy protection from Personalized Speech generation Jan 22, 2024 Speaker Verification text-to-speech
— Unverified 0Creating New Voices using Normalizing Flows Dec 22, 2023 Speech Synthesis text-to-speech
— Unverified 0GenVC: Self-Supervised Zero-Shot Voice Conversion Feb 6, 2025 Voice Conversion
— Unverified 0Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features Nov 9, 2022 Decoder Voice Conversion
— Unverified 0