Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices Oct 12, 2023 Voice Conversion
— Unverified 0AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual Voice Conversion Oct 10, 2023 Voice Conversion
— Unverified 0A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023 Oct 8, 2023 Self-Supervised Learning Task 2
— Unverified 0VITS-Based Singing Voice Conversion Leveraging Whisper and multi-scale F0 Modeling Oct 4, 2023 Decoder Voice Conversion
— Unverified 0DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion Sep 27, 2023 Decoder Knowledge Distillation
— Unverified 0Towards General-Purpose Text-Instruction-Guided Voice Conversion Sep 25, 2023 Language Modeling Language Modelling
— Unverified 0The Impact of Silence on Speech Anti-Spoofing Sep 21, 2023 Action Detection Activity Detection
— Unverified 0Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment Sep 18, 2023 Voice Conversion
— Unverified 0PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts Sep 17, 2023 Voice Conversion
— Unverified 0Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses Sep 15, 2023 Speaker Verification Voice Conversion
— Unverified 0Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech Sep 15, 2023 Knowledge Distillation Speech Synthesis
— Unverified 0Parallel and Limited Data Voice Conversion Using Stochastic Variational Deep Kernel Learning Sep 8, 2023 Voice Conversion
— Unverified 0Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data Sep 6, 2023 Decoder Self-Supervised Learning
— Unverified 0MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling Sep 3, 2023 Data Augmentation Disentanglement
— Unverified 0Learning Speech Representation From Contrastive Token-Acoustic Pretraining Sep 1, 2023 Audio Classification Automatic Speech Recognition
— Unverified 0Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations Aug 24, 2023 Representation Learning Speech Synthesis
— Unverified 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0Effects of Convolutional Autoencoder Bottleneck Width on StarGAN-based Singing Technique Conversion Aug 19, 2023 Voice Conversion
— Unverified 0SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs Jul 18, 2023 Generative Adversarial Network Language Modeling
— Unverified 0Deep Learning-based F0 Synthesis for Speaker Anonymization Jun 29, 2023 Deep Learning Speaker anonymization
— Unverified 0Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion Jun 28, 2023 Backdoor Attack Voice Conversion
— Unverified 0Two-Stage Voice Anonymization for Enhanced Privacy Jun 28, 2023 Voice Conversion
— Unverified 0Automatic Speech Disentanglement for Voice Conversion using Rank Module and Speech Augmentation Jun 21, 2023 Disentanglement Rhythm
— Unverified 0LM-VC: Zero-shot Voice Conversion via Speech Generation based on Language Models Jun 18, 2023 Audio Generation Disentanglement
— Unverified 0ALO-VC: Any-to-any Low-latency One-shot Voice Conversion Jun 1, 2023 CPU Voice Conversion
— Unverified 0Make-A-Voice: Unified Voice Synthesis With Discrete Representation May 30, 2023 Singing Voice Synthesis text-to-speech
— Unverified 0Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models May 27, 2023 Speech Synthesis Voice Conversion
— Unverified 0Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion May 25, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 0Iteratively Improving Speech Recognition and Voice Conversion May 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding May 21, 2023 Data Augmentation Decoder
— Unverified 0Data Augmentation for Diverse Voice Conversion in Noisy Environments May 18, 2023 Data Augmentation Decoder
— Unverified 0Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion May 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-level Temporal-channel Speaker Retrieval for Zero-shot Voice Conversion May 12, 2023 Disentanglement Retrieval
— Unverified 0AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment May 8, 2023 cross-modal alignment Rhythm
— Unverified 0Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0Self-Supervised Representations for Singing Voice Conversion Mar 21, 2023 Disentanglement Voice Conversion
— Unverified 0A Comparative Analysis Of Latent Regressor Losses For Singing Voice Conversion Feb 27, 2023 Contrastive Learning Disentanglement
— Unverified 0Cross-modal Face- and Voice-style Transfer Feb 27, 2023 Diversity Image-to-Image Translation
— Unverified 0Catch You and I Can: Revealing Source Voiceprint Against Voice Conversion Feb 24, 2023 Representation Learning Speaker Verification
— Unverified 0Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing Feb 21, 2023 Voice Conversion
— Unverified 0ACE-VC: Adaptive and Controllable Voice Conversion using Explicitly Disentangled Self-supervised Speech Representations Feb 16, 2023 Self-Supervised Learning Speaker Verification
— Unverified 0Modelling low-resource accents without accent-specific TTS frontend Jan 11, 2023 text-to-speech Text to Speech
— Unverified 0UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion Jan 10, 2023 Quantization text-to-speech
— Unverified 0VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion Dec 20, 2022 Backdoor Attack Keyword Spotting
— Unverified 0Disentangling Prosody Representations with Unsupervised Speech Reconstruction Dec 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Disentangled Feature Learning for Real-Time Neural Speech Coding Nov 22, 2022 Disentanglement Representation Learning
— Unverified 0Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning Nov 17, 2022 Binary Classification Meta-Learning
— Unverified 0Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints Nov 16, 2022 Voice Conversion
— Unverified 0Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder Nov 15, 2022 Contrastive Learning Disentanglement
— Unverified 0Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features Nov 9, 2022 Decoder Voice Conversion
— Unverified 0