AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual Voice Conversion Oct 10, 2023 Voice Conversion
— Unverified 0A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023 Oct 8, 2023 Self-Supervised Learning Task 2
— Unverified 0VITS-Based Singing Voice Conversion Leveraging Whisper and multi-scale F0 Modeling Oct 4, 2023 Decoder Voice Conversion
— Unverified 0DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion Sep 27, 2023 Decoder Knowledge Distillation
— Unverified 0Towards General-Purpose Text-Instruction-Guided Voice Conversion Sep 25, 2023 Language Modeling Language Modelling
— Unverified 0BiSinger: Bilingual Singing Voice Synthesis Sep 25, 2023 Singing Voice Synthesis text-to-speech
Code Code Available 1The Impact of Silence on Speech Anti-Spoofing Sep 21, 2023 Action Detection Activity Detection
— Unverified 0Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment Sep 18, 2023 Voice Conversion
— Unverified 0PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts Sep 17, 2023 Voice Conversion
— Unverified 0Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses Sep 15, 2023 Speaker Verification Voice Conversion
— Unverified 0HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methods Sep 15, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 1Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech Sep 15, 2023 Knowledge Distillation Speech Synthesis
— Unverified 0StarGAN-VC++: Towards Emotion Preserving Voice Conversion Using Deep Embeddings Sep 14, 2023 Generative Adversarial Network Voice Conversion
Code Code Available 1Emo-StarGAN: A Semi-Supervised Any-to-Many Non-Parallel Emotion-Preserving Voice Conversion Sep 14, 2023 Voice Conversion
Code Code Available 1Parallel and Limited Data Voice Conversion Using Stochastic Variational Deep Kernel Learning Sep 8, 2023 Voice Conversion
— Unverified 0Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data Sep 6, 2023 Decoder Self-Supervised Learning
— Unverified 0Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion Sep 5, 2023 Voice Conversion
Code Code Available 1FSD: An Initial Chinese Dataset for Fake Song Detection Sep 5, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 1MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling Sep 3, 2023 Data Augmentation Disentanglement
— Unverified 0Learning Speech Representation From Contrastive Token-Acoustic Pretraining Sep 1, 2023 Audio Classification Automatic Speech Recognition
— Unverified 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations Aug 24, 2023 Representation Learning Speech Synthesis
— Unverified 0Effects of Convolutional Autoencoder Bottleneck Width on StarGAN-based Singing Technique Conversion Aug 19, 2023 Voice Conversion
— Unverified 0Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion Aug 11, 2023 Voice Conversion
Code Code Available 2Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques Aug 5, 2023 Quantization Speaker anonymization
Code Code Available 1SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs Jul 18, 2023 Generative Adversarial Network Language Modeling
— Unverified 0Rhythm Modeling for Voice Conversion Jul 12, 2023 Rhythm Voice Conversion
Code Code Available 1Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 1Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion Jul 1, 2023 speech-recognition Speech Recognition
Code Code Available 1Deep Learning-based F0 Synthesis for Speaker Anonymization Jun 29, 2023 Deep Learning Speaker anonymization
— Unverified 0Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion Jun 28, 2023 Backdoor Attack Voice Conversion
— Unverified 0Two-Stage Voice Anonymization for Enhanced Privacy Jun 28, 2023 Voice Conversion
— Unverified 0The Singing Voice Conversion Challenge 2023 Jun 26, 2023 Voice Conversion
Code Code Available 1Automatic Speech Disentanglement for Voice Conversion using Rank Module and Speech Augmentation Jun 21, 2023 Disentanglement Rhythm
— Unverified 0LM-VC: Zero-shot Voice Conversion via Speech Generation based on Language Models Jun 18, 2023 Audio Generation Disentanglement
— Unverified 0DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model Jun 18, 2023 Data Augmentation Decoder
Code Code Available 1ALO-VC: Any-to-any Low-latency One-shot Voice Conversion Jun 1, 2023 CPU Voice Conversion
— Unverified 0Voice Conversion With Just Nearest Neighbors May 30, 2023 Voice Conversion
Code Code Available 2Make-A-Voice: Unified Voice Synthesis With Discrete Representation May 30, 2023 Singing Voice Synthesis text-to-speech
— Unverified 0Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models May 27, 2023 Speech Synthesis Voice Conversion
— Unverified 0Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion May 25, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 0DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion May 25, 2023 Denoising Style Transfer
Code Code Available 2Iteratively Improving Speech Recognition and Voice Conversion May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding May 21, 2023 Data Augmentation Decoder
— Unverified 0Data Augmentation for Diverse Voice Conversion in Noisy Environments May 18, 2023 Data Augmentation Decoder
— Unverified 0Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion May 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice Conversion May 12, 2023 Disentanglement Retrieval
— Unverified 0AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment May 8, 2023 cross-modal alignment Rhythm
— Unverified 0Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0Self-Supervised Representations for Singing Voice Conversion Mar 21, 2023 Disentanglement Voice Conversion
— Unverified 0