Boosting Star-GANs for Voice Conversion with Contrastive Discriminator Sep 21, 2022 Contrastive Learning Voice Conversion
— Unverified 00 Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech Sep 14, 2019 text-to-speech Text to Speech
— Unverified 00 Building Synthetic Voices in the META-NET Framework May 1, 2012 Speech Synthesis Voice Conversion
— Unverified 00 Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data Mar 2, 2018 Generative Adversarial Network Speech Enhancement
— Unverified 00 Catch You and I Can: Revealing Source Voiceprint Against Voice Conversion Feb 24, 2023 Representation Learning Speaker Verification
— Unverified 00 Change your singer: a transfer learning generative adversarial framework for song to song conversion Nov 7, 2019 Transfer Learning Voice Conversion
— Unverified 00 ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech May 20, 2025 Voice Conversion
— Unverified 00 ClsVC: Learning Speech Representations with two different classification tasks. Sep 29, 2021 Classification Vocal Bursts Valence Prediction
— Unverified 00 Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion Apr 18, 2025 Generative Adversarial Network Image Generation
— Unverified 00 Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection Oct 31, 2022 Audio Compression Face Swapping
— Unverified 00 Comparison of Speech Representations for the MOS Prediction System Jun 28, 2022 Self-Supervised Learning text-to-speech
— Unverified 00 Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion Dec 6, 2021 Decoder Voice Conversion
— Unverified 00 Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model May 2, 2024 Denoising Emotion Recognition
— Unverified 00 ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion Nov 5, 2018 Speech Enhancement Voice Conversion
— Unverified 00 CO-VADA: A Confidence-Oriented Voice Augmentation Debiasing Approach for Fair Speech Emotion Recognition Jun 6, 2025 Emotion Recognition Fairness
— Unverified 00 Learning Speech Representation From Contrastive Token-Acoustic Pretraining Sep 1, 2023 Audio Classification Automatic Speech Recognition
— Unverified 00 Creating New Voices using Normalizing Flows Dec 22, 2023 Speech Synthesis text-to-speech
— Unverified 00 Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models May 27, 2023 Speech Synthesis Voice Conversion
— Unverified 00 Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech Sep 15, 2023 Knowledge Distillation Speech Synthesis
— Unverified 00 Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation Oct 31, 2022 Decoder Disentanglement
— Unverified 00 Cross-modal Face- and Voice-style Transfer Feb 27, 2023 Diversity Image-to-Image Translation
— Unverified 00 Crossmodal Voice Conversion Apr 9, 2019 Decoder Voice Conversion
— Unverified 00 Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation Apr 21, 2022 Data Augmentation text-to-speech
— Unverified 00 Cross-speaker style transfer for text-to-speech using data augmentation Feb 10, 2022 Data Augmentation Style Transfer
— Unverified 00 CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching Nov 4, 2024 Speaker Verification Voice Conversion
— Unverified 00 Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion Nov 24, 2023 Data Augmentation Retrieval
— Unverified 00 CycleFlow: Purify Information Factors by Cycle Loss Oct 18, 2021 Voice Conversion
— Unverified 00 Data Augmentation for Diverse Voice Conversion in Noisy Environments May 18, 2023 Data Augmentation Decoder
— Unverified 00 D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack Sep 11, 2024 Adversarial Attack Audio Synthesis
— Unverified 00 DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Oct 13, 2021 Speech Synthesis Voice Conversion
— Unverified 00 Deep Learning-based F0 Synthesis for Speaker Anonymization Jun 29, 2023 Deep Learning Speaker anonymization
— Unverified 00 Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling Aug 9, 2020 Deep Learning Speech Synthesis
— Unverified 00 DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices Aug 15, 2020 Speaker Recognition Voice Conversion
— Unverified 00 Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints Nov 16, 2022 Voice Conversion
— Unverified 00 Dictionary Update for NMF-based Voice Conversion Using an Encoder-Decoder Network Oct 13, 2016 Decoder Speech Enhancement
— Unverified 00 Differentiable WORLD Synthesizer-based Neural Vocoder With Application To End-To-End Audio Style Transfer Aug 15, 2022 Style Transfer Voice Conversion
— Unverified 00 ZSDEVC: Zero-Shot Diffusion-based Emotional Voice Conversion with Disentangled Mechanism Sep 5, 2024 Emotion Classification Voice Conversion
— Unverified 00 DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion May 28, 2021 Denoising Voice Conversion
— Unverified 00 Discl-VC: Disentangled Discrete Tokens and In-Context Learning for Controllable Zero-Shot Voice Conversion May 30, 2025 In-Context Learning Voice Conversion
— Unverified 00 Discrete Optimal Transport and Voice Conversion May 7, 2025 Audio Generation Voice Conversion
— Unverified 00 Discrete Unit based Masking for Improving Disentanglement in Voice Conversion Sep 17, 2024 Decoder Disentanglement
— Unverified 00 DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion Oct 20, 2022 Voice Conversion
— Unverified 00 Disentangled Feature Learning for Real-Time Neural Speech Coding Nov 22, 2022 Disentanglement Representation Learning
— Unverified 00 Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective Apr 5, 2022 Disentanglement Representation Learning
— Unverified 00 Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE Oct 25, 2022 Disentanglement Representation Learning
— Unverified 00 Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Disentangling Prosody Representations with Unsupervised Speech Reconstruction Dec 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 DNN-based cross-lingual voice conversion using Bottleneck Features Sep 9, 2019 Voice Conversion
— Unverified 00 DreamVoice: Text-Guided Voice Conversion Jun 24, 2024 text-guided-generation Voice Conversion
— Unverified 00 DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion Sep 27, 2023 Decoder Knowledge Distillation
— Unverified 00