GPU-Friendly Local Regression for Voice Conversion May 1, 2015 CPU GPU
— Unverified 0Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech Sep 15, 2023 Knowledge Distillation Speech Synthesis
— Unverified 0SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers Jan 30, 2024 Voice Conversion
— Unverified 0Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment Sep 18, 2023 Voice Conversion
— Unverified 0Hierarchical Sequence to Sequence Voice Conversion with Limited Data Jul 15, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion Dec 6, 2021 Decoder Voice Conversion
— Unverified 0EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion May 22, 2025 Decoder Voice Conversion
— Unverified 0Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Jul 8, 2021 Emotion Recognition Speech Emotion Recognition
— Unverified 0High Fidelity Speech Regeneration with Application to Speech Enhancement Jan 31, 2021 Denoising Speaker Separation
— Unverified 0High-quality nonparallel voice conversion based on cycle-consistent adversarial network Apr 2, 2018 Generative Adversarial Network Image-to-Image Translation
— Unverified 0Comparison of Speech Representations for the MOS Prediction System Jun 28, 2022 Self-Supervised Learning text-to-speech
— Unverified 0A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion Jun 2, 2021 Voice Conversion
— Unverified 0Adversarial speech for voice privacy protection from Personalized Speech generation Jan 22, 2024 Speaker Verification text-to-speech
— Unverified 0Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems Jun 18, 2022 Speaker Identification Speaker Verification
— Unverified 0Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion Oct 20, 2021 Disentanglement Voice Conversion
— Unverified 0Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder Nov 15, 2022 Contrastive Learning Disentanglement
— Unverified 0Improve few-shot voice cloning using multi-modal learning Mar 18, 2022 text-to-speech Text to Speech
— Unverified 0Improving child speech recognition with augmented child-like speech Jun 12, 2024 speech-recognition Speech Recognition
— Unverified 0Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features Nov 9, 2022 Decoder Voice Conversion
— Unverified 0Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models Oct 13, 2021 Resynthesis Speaker anonymization
— Unverified 0Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses Sep 15, 2023 Speaker Verification Voice Conversion
— Unverified 0Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses Oct 20, 2024 Voice Conversion
— Unverified 0Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion May 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring synthetic data for cross-speaker style transfer in style representation based TTS Sep 25, 2024 Style Transfer text-to-speech
— Unverified 0Exploring data augmentation in bias mitigation against non-native-accented speech Dec 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Individuality-Preserving Voice Conversion for Articulation Disorders Using Locality-Constrained NMF Aug 1, 2013 Speech Recognition Voice Conversion
— Unverified 0Invertible Voice Conversion Jan 26, 2022 Voice Conversion
— Unverified 0Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack Sep 11, 2024 Adversarial Attack Audio Synthesis
— Unverified 0Investigating self-supervised features for expressive, multilingual voice conversion May 13, 2025 Self-Supervised Learning Speech Synthesis
— Unverified 0A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023 Oct 8, 2023 Self-Supervised Learning Task 2
— Unverified 0Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0Investigation of using disentangled and interpretable representations with language conditioning for cross-lingual voice conversion Oct 22, 2018 One-Shot Learning Voice Conversion
— Unverified 0IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion Jan 2, 2022 Quantization Voice Conversion
— Unverified 0Evaluating Voice Conversion-based Privacy Protection against Informed Attackers Nov 10, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Iteratively Improving Speech Recognition and Voice Conversion May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection Oct 31, 2022 Audio Compression Face Swapping
— Unverified 0Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet Mar 29, 2019 Decoder Speech Synthesis
— Unverified 0Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation May 25, 2025 Disentanglement Self-Supervised Learning
— Unverified 0Latent linguistic embedding for cross-lingual text-to-speech and voice conversion Oct 8, 2020 text-to-speech Text to Speech
— Unverified 0LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Aug 22, 2024 Voice Conversion
— Unverified 0LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance Jun 8, 2024 Voice Conversion
— Unverified 0Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion Apr 18, 2025 Generative Adversarial Network Image Generation
— Unverified 0Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion Nov 3, 2020 speech-recognition Speech Recognition
— Unverified 0Learning Paralinguistic Features from Audiobooks through Style Voice Conversion Jun 1, 2021 Emotion Recognition Style Detection
— Unverified 0Learning Singing From Speech Dec 20, 2019 Speech Synthesis Voice Conversion
— Unverified 0Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models Jun 27, 2024 Speaker Verification text-to-speech
— Unverified 0AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge Aug 30, 2024 DeepFake Detection Face Swapping
— Unverified 0Error Reduction Network for DBLSTM-based Voice Conversion Sep 26, 2018 Voice Conversion
— Unverified 0ClsVC: Learning Speech Representations with two different classification tasks. Sep 29, 2021 Classification Vocal Bursts Valence Prediction
— Unverified 0