DNN-based cross-lingual voice conversion using Bottleneck Features Sep 9, 2019 Voice Conversion
— Unverified 0DreamVoice: Text-Guided Voice Conversion Jun 24, 2024 text-guided-generation Voice Conversion
— Unverified 0DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion Sep 27, 2023 Decoder Knowledge Distillation
— Unverified 0DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion Jun 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding May 21, 2023 Data Augmentation Decoder
— Unverified 0Effects of Convolutional Autoencoder Bottleneck Width on StarGAN-based Singing Technique Conversion Aug 19, 2023 Voice Conversion
— Unverified 0EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models Aug 28, 2024 Attribute Backdoor Attack
— Unverified 0EmoCat: Language-agnostic Emotional Voice Conversion Jan 14, 2021 Decoder text-to-speech
— Unverified 0EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion Dec 29, 2024 Self-Supervised Learning Voice Conversion
— Unverified 0Emotion Intensity and its Control for Emotional Voice Conversion Jan 10, 2022 Emotion Classification Voice Conversion
— Unverified 0End-to-End Voice Conversion with Information Perturbation Jun 15, 2022 Voice Conversion
— Unverified 0Enhancing the Stability of LLM-based Speech Generation Systems through Self-Supervised Representations Feb 5, 2024 Decoder In-Context Learning
— Unverified 0Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE Mar 30, 2022 Decoder Sentence
— Unverified 0Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion Jun 16, 2021 Style Transfer Voice Conversion
— Unverified 0Error Reduction Network for DBLSTM-based Voice Conversion Sep 26, 2018 Voice Conversion
— Unverified 0Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation May 25, 2025 Disentanglement Self-Supervised Learning
— Unverified 0Evaluating Voice Conversion-based Privacy Protection against Informed Attackers Nov 10, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0Exploring data augmentation in bias mitigation against non-native-accented speech Dec 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring synthetic data for cross-speaker style transfer in style representation based TTS Sep 25, 2024 Style Transfer text-to-speech
— Unverified 0Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models Oct 13, 2021 Resynthesis Speaker anonymization
— Unverified 0Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features Nov 9, 2022 Decoder Voice Conversion
— Unverified 0Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer Jul 8, 2021 Emotion Recognition Speech Emotion Recognition
— Unverified 0EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion May 22, 2025 Decoder Voice Conversion
— Unverified 0Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment Sep 18, 2023 Voice Conversion
— Unverified 0Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos Jun 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning Apr 22, 2025 Deep Learning Speaker Verification
— Unverified 0Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion Jun 28, 2023 Backdoor Attack Voice Conversion
— Unverified 0Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART Mar 1, 2024 Retrieval Translation
— Unverified 0Transfer the linguistic representations from TTS to accent conversion with non-parallel data Jan 7, 2024 text-to-speech Text to Speech
— Unverified 0Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis Jul 25, 2022 Data Augmentation Speech Synthesis
— Unverified 0Two-Stage Voice Anonymization for Enhanced Privacy Jun 28, 2023 Voice Conversion
— Unverified 0UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion Jan 10, 2023 Quantization text-to-speech
— Unverified 0Unsupervised Cross-Domain Singing Voice Conversion Aug 6, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Jan 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised Singing Voice Conversion Apr 13, 2019 Data Augmentation Decoder
— Unverified 0Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset Sep 14, 2022 text-to-speech Text to Speech
— Unverified 0USTC-KXDIGIT System Description for ASVspoof5 Challenge Sep 3, 2024 DeepFake Detection Face Swapping
— Unverified 0V2S attack: building DNN-based voice conversion from automatic speaker verification Aug 5, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech Nov 3, 2020 Decoder Disentanglement
— Unverified 0VAW-GAN for Singing Voice Conversion with Non-parallel Training Data Aug 10, 2020 Decoder Generative Adversarial Network
— Unverified 0VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion Sep 10, 2024 Bandwidth Extension Voice Conversion
— Unverified 0V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Feb 18, 2022 Quantization Speech Synthesis
— Unverified 0vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders Sep 3, 2024 Speech Synthesis Voice Conversion
— Unverified 0Versatile Speech Databases for High Quality Synthesis for Basque May 1, 2012 Emotional Speech Synthesis Speech Synthesis
— Unverified 0Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement Feb 11, 2025 Disentanglement text-to-speech
— Unverified 0VibE-SVC: Vibrato Extraction with High-frequency F0 Contour for Singing Voice Conversion May 27, 2025 Voice Conversion
— Unverified 0VITS-Based Singing Voice Conversion Leveraging Whisper and multi-scale F0 Modeling Oct 4, 2023 Decoder Voice Conversion
— Unverified 0Voice Conversion Augmentation for Speaker Recognition on Defective Datasets Apr 1, 2024 Speaker Recognition Voice Conversion
— Unverified 0