FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning Apr 22, 2025 Deep Learning Speaker Verification
— Unverified 0Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion Apr 18, 2025 Generative Adversarial Network Image Generation
— Unverified 0Voice Conversion with Diverse Intonation using Conditional Variational Auto-Encoder Apr 16, 2025 Diversity Voice Conversion
— Unverified 0Mitigating Timbre Leakage with Universal Semantic Mapping Residual Block for Voice Conversion Apr 11, 2025 Voice Conversion
— Unverified 0An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR Mar 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge Feb 18, 2025 Voice Conversion
Code Code Available 0ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 0Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement Feb 11, 2025 Disentanglement text-to-speech
— Unverified 0Singing Voice Conversion with Accompaniment Using Self-Supervised Representation-Based Melody Features Feb 7, 2025 Melody Extraction Self-Supervised Learning
— Unverified 0GenVC: Self-Supervised Zero-Shot Voice Conversion Feb 6, 2025 Voice Conversion
— Unverified 0FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks Feb 6, 2025 Resynthesis Voice Conversion
— Unverified 0VoicePrompter: Robust Zero-Shot Voice Conversion with Voice Prompt and Conditional Flow Matching Jan 29, 2025 Decoder In-Context Learning
— Unverified 0Stepback: Enhanced Disentanglement for Voice Conversion via Multi-Task Learning Jan 26, 2025 Disentanglement Multi-Task Learning
— Unverified 0Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation Jan 24, 2025 Audio Deepfake Detection DeepFake Detection
— Unverified 0Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Jan 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Synthesis along Perceptual Voice Quality Dimensions Jan 15, 2025 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training Jan 8, 2025 In-Context Learning Voice Conversion
— Unverified 0Generating and Detecting Various Types of Fake Image and Audio Content: A Review of Modern Deep Learning Technologies and Tools Jan 7, 2025 Face Swapping Voice Conversion
— Unverified 0AdaptVC: High Quality Voice Conversion with Adaptive Learning Jan 2, 2025 Decoder Disentanglement
— Unverified 0EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion Dec 29, 2024 Self-Supervised Learning Voice Conversion
— Unverified 0A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction Dec 11, 2024 Decoder Self-Supervised Learning
— Unverified 0StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching Dec 6, 2024 Voice Conversion
— Unverified 0Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities Nov 29, 2024 Representation Learning Self-Supervised Learning
— Unverified 0SKQVC: One-Shot Voice Conversion by K-Means Quantization with Self-Supervised Speech Representations Nov 25, 2024 Quantization Self-Supervised Learning
— Unverified 0CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching Nov 4, 2024 Speaker Verification Voice Conversion
— Unverified 0LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec Oct 21, 2024 Disentanglement Language Modeling
— Unverified 0Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses Oct 20, 2024 Voice Conversion
— Unverified 0Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example Oct 20, 2024 Voice Conversion
Code Code Available 0A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Takin-VC: Expressive Zero-Shot Voice Conversion via Adaptive Hybrid Content Encoding and Enhanced Timbre Modeling Oct 2, 2024 Voice Conversion
— Unverified 0Exploring synthetic data for cross-speaker style transfer in style representation based TTS Sep 25, 2024 Style Transfer text-to-speech
— Unverified 0Textless NLP -- Zero Resource Challenge with Low Resource Compute Sep 24, 2024 Acoustic Unit Discovery GPU
— Unverified 0Discrete Unit based Masking for Improving Disentanglement in Voice Conversion Sep 17, 2024 Decoder Disentanglement
— Unverified 0LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling Sep 13, 2024 CPU Rhythm
— Unverified 0HLTCOE JHU Submission to the Voice Privacy Challenge 2024 Sep 13, 2024 text-to-speech Text to Speech
— Unverified 0D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack Sep 11, 2024 Adversarial Attack Audio Synthesis
— Unverified 0VoiceWukong: Benchmarking Deepfake Voice Detection Sep 10, 2024 Benchmarking Face Swapping
— Unverified 0VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion Sep 10, 2024 Bandwidth Extension Voice Conversion
— Unverified 0Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder Sep 5, 2024 Disentanglement Voice Conversion
— Unverified 0ZSDEVC: Zero-Shot Diffusion-based Emotional Voice Conversion with Disentangled Mechanism Sep 5, 2024 Emotion Classification Voice Conversion
— Unverified 0vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders Sep 3, 2024 Speech Synthesis Voice Conversion
— Unverified 0FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation Sep 3, 2024 Voice Conversion
— Unverified 0USTC-KXDIGIT System Description for ASVspoof5 Challenge Sep 3, 2024 DeepFake Detection Face Swapping
— Unverified 0Pureformer-VC: Non-parallel One-Shot Voice Conversion with Pure Transformer Blocks and Triplet Discriminative Training Sep 3, 2024 Decoder Disentanglement
— Unverified 0Seeing Your Speech Style: A Novel Zero-Shot Identity-Disentanglement Face-based Voice Conversion Sep 1, 2024 Contrastive Learning Disentanglement
— Unverified 0Progressive Residual Extraction based Pre-training for Speech Representation Learning Aug 31, 2024 Emotion Recognition Representation Learning
— Unverified 0AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge Aug 30, 2024 DeepFake Detection Face Swapping
— Unverified 0EmoAttack: Utilizing Emotional Voice Conversion for Speech Backdoor Attacks on Deep Speech Classification Models Aug 28, 2024 Attribute Backdoor Attack
— Unverified 0MaskCycleGAN-based Whisper to Normal Speech Conversion Aug 27, 2024 Generative Adversarial Network Voice Conversion
— Unverified 0