Region-Based Optimization in Continual Learning for Audio Deepfake Detection Dec 16, 2024 Audio Deepfake Detection Continual Learning
Code Code Available 1A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction Dec 11, 2024 Decoder Self-Supervised Learning
— Unverified 0StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching Dec 6, 2024 Voice Conversion
— Unverified 0Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities Nov 29, 2024 Representation Learning Self-Supervised Learning
— Unverified 0SKQVC: One-Shot Voice Conversion by K-Means Quantization with Self-Supervised Speech Representations Nov 25, 2024 Quantization Self-Supervised Learning
— Unverified 0Zero-shot Voice Conversion with Diffusion Transformers Nov 15, 2024 In-Context Learning Voice Conversion
Code Code Available 7CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching Nov 4, 2024 Speaker Verification Voice Conversion
— Unverified 0Audio Deepfake Detection with Self-Supervised XLS-R and SLS Classifier Oct 28, 2024 Audio Deepfake Detection Audio Generation
Code Code Available 2LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec Oct 21, 2024 Disentanglement Language Modeling
— Unverified 0Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses Oct 20, 2024 Voice Conversion
— Unverified 0Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example Oct 20, 2024 Voice Conversion
Code Code Available 0Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 Matching Oct 8, 2024 Data Augmentation Style Transfer
Code Code Available 4Where are we in audio deepfake detection? A systematic analysis over generative and detection models Oct 6, 2024 Audio Deepfake Detection Audio Synthesis
Code Code Available 1A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Takin-VC: Expressive Zero-Shot Voice Conversion via Adaptive Hybrid Content Encoding and Enhanced Timbre Modeling Oct 2, 2024 Voice Conversion
— Unverified 0Exploring synthetic data for cross-speaker style transfer in style representation based TTS Sep 25, 2024 Style Transfer text-to-speech
— Unverified 0Textless NLP -- Zero Resource Challenge with Low Resource Compute Sep 24, 2024 Acoustic Unit Discovery GPU
— Unverified 0Discrete Unit based Masking for Improving Disentanglement in Voice Conversion Sep 17, 2024 Decoder Disentanglement
— Unverified 0SafeEar: Content Privacy-Preserving Audio Deepfake Detection Sep 14, 2024 Audio Deepfake Detection DeepFake Detection
Code Code Available 2HLTCOE JHU Submission to the Voice Privacy Challenge 2024 Sep 13, 2024 text-to-speech Text to Speech
— Unverified 0LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling Sep 13, 2024 CPU Rhythm
— Unverified 0D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack Sep 11, 2024 Adversarial Attack Audio Synthesis
— Unverified 0VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion Sep 10, 2024 Bandwidth Extension Voice Conversion
— Unverified 0VoiceWukong: Benchmarking Deepfake Voice Detection Sep 10, 2024 Benchmarking Face Swapping
— Unverified 0Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder Sep 5, 2024 Disentanglement Voice Conversion
— Unverified 0ZSDEVC: Zero-Shot Diffusion-based Emotional Voice Conversion with Disentangled Mechanism Sep 5, 2024 Emotion Classification Voice Conversion
— Unverified 0FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation Sep 3, 2024 Voice Conversion
— Unverified 0Pureformer-VC: Non-parallel One-Shot Voice Conversion with Pure Transformer Blocks and Triplet Discriminative Training Sep 3, 2024 Decoder Disentanglement
— Unverified 0USTC-KXDIGIT System Description for ASVspoof5 Challenge Sep 3, 2024 DeepFake Detection Face Swapping
— Unverified 0vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders Sep 3, 2024 Speech Synthesis Voice Conversion
— Unverified 0Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion Sep 1, 2024 Contrastive Learning Disentanglement
— Unverified 0Progressive Residual Extraction based Pre-training for Speech Representation Learning Aug 31, 2024 Emotion Recognition Representation Learning
— Unverified 0AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge Aug 30, 2024 DeepFake Detection Face Swapping
— Unverified 0EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models Aug 28, 2024 Attribute Backdoor Attack
— Unverified 0MaskCycleGAN-based Whisper to Normal Speech Conversion Aug 27, 2024 Generative Adversarial Network Voice Conversion
— Unverified 0Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples Aug 23, 2024 Data Augmentation Meta-Learning
— Unverified 0LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation Aug 22, 2024 Voice Conversion
— Unverified 0Hear Your Face: Face-based voice conversion with F0 estimation Aug 19, 2024 Voice Conversion
Code Code Available 0VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MulliVC: Multi-lingual Voice Conversion With Cycle Consistency Aug 8, 2024 Voice Conversion
— Unverified 0StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion Aug 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Voice Identification after Speech Resynthesis using PPG Aug 5, 2024 Resynthesis Speaker Verification
— Unverified 0Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity Jul 20, 2024 Diversity Rhythm
— Unverified 0The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation Jul 16, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Source Tracing of Audio Deepfake Systems Jul 10, 2024 Face Swapping text-to-speech
— Unverified 0SaMoye: Zero-shot Singing Voice Conversion Model Based on Feature Disentanglement and Enhancement Jul 10, 2024 Disentanglement Voice Conversion
Code Code Available 2We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings Jul 5, 2024 Speaker Recognition Speech Synthesis
— Unverified 0Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models Jun 27, 2024 Speaker Verification text-to-speech
— Unverified 0RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging Jun 24, 2024 Sentence Voice Conversion
— Unverified 0DreamVoice: Text-Guided Voice Conversion Jun 24, 2024 text-guided-generation Voice Conversion
— Unverified 0