TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion Mar 16, 2023 Decoder Voice Conversion
Code Code Available 1Cross-modal Face- and Voice-style Transfer Feb 27, 2023 Diversity Image-to-Image Translation
— Unverified 0A Comparative Analysis Of Latent Regressor Losses For Singing Voice Conversion Feb 27, 2023 Contrastive Learning Disentanglement
— Unverified 0Catch You and I Can: Revealing Source Voiceprint Against Voice Conversion Feb 24, 2023 Representation Learning Speaker Verification
— Unverified 0Nonparallel Emotional Voice Conversion For Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing Feb 21, 2023 Voice Conversion
— Unverified 0ACE-VC: Adaptive and Controllable Voice Conversion using Explicitly Disentangled Self-supervised Speech Representations Feb 16, 2023 Self-Supervised Learning Speaker Verification
— Unverified 0Modelling low-resource accents without accent-specific TTS frontend Jan 11, 2023 text-to-speech Text to Speech
— Unverified 0UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion Jan 10, 2023 Quantization text-to-speech
— Unverified 0M4Singer: a Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus Dec 29, 2022 Music Transcription Singing Voice Synthesis
Code Code Available 2StyleTTS-VC: One-Shot Voice Conversion by Knowledge Transfer from Style-Based TTS Models Dec 29, 2022 Data Augmentation text-to-speech
Code Code Available 1VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion Dec 20, 2022 Backdoor Attack Keyword Spotting
— Unverified 0Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units Dec 19, 2022 Rhythm Voice Conversion
Code Code Available 1Disentangling Prosody Representations with Unsupervised Speech Reconstruction Dec 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SpeechLMScore: Evaluating speech generation using speech language model Dec 8, 2022 Language Modeling Language Modelling
Code Code Available 1Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline Nov 29, 2022 Voice Conversion
Code Code Available 1Disentangled Feature Learning for Real-Time Neural Speech Coding Nov 22, 2022 Disentanglement Representation Learning
— Unverified 0Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning Nov 17, 2022 Binary Classification Meta-Learning
— Unverified 0Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints Nov 16, 2022 Voice Conversion
— Unverified 0Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder Nov 15, 2022 Contrastive Learning Disentanglement
— Unverified 0A unified one-shot prosody and speaker conversion system with self-supervised discrete speech units Nov 12, 2022 Rhythm Voice Conversion
Code Code Available 1Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features Nov 9, 2022 Decoder Voice Conversion
— Unverified 0Preserving background sound in noise-robust voice conversion via multi-task learning Nov 6, 2022 Multi-Task Learning Voice Conversion
— Unverified 0Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation Oct 31, 2022 Decoder Disentanglement
— Unverified 0Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection Oct 31, 2022 Audio Compression Face Swapping
— Unverified 0V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion Oct 27, 2022 Data Augmentation text annotation
Code Code Available 2Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion Oct 25, 2022 Attribute Voice Conversion
— Unverified 0Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE Oct 25, 2022 Disentanglement Representation Learning
— Unverified 0MetaSpeech: Speech Effects Switch Along with Environment for Metaverse Oct 25, 2022 Voice Conversion
— Unverified 0Robust One-Shot Singing Voice Conversion Oct 20, 2022 Voice Conversion
— Unverified 0DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion Oct 20, 2022 Voice Conversion
— Unverified 0GAN You Hear Me? Reclaiming Unconditional Speech Synthesis from Diffusion Models Oct 11, 2022 Disentanglement Generative Adversarial Network
Code Code Available 1Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward Oct 2, 2022 Misinformation Speaker Verification
Code Code Available 1ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Speed Sep 23, 2022 Pitch control Speech Synthesis
Code Code Available 1Boosting Star-GANs for Voice Conversion with Contrastive Discriminator Sep 21, 2022 Contrastive Learning Voice Conversion
— Unverified 0Non-Parallel Voice Conversion for ASR Augmentation Sep 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset Sep 14, 2022 text-to-speech Text to Speech
— Unverified 0DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion Sep 9, 2022 De-identification Speaker Verification
Code Code Available 1Investigation into Target Speaking Rate Adaptation for Voice Conversion Sep 5, 2022 Disentanglement Representation Learning
— Unverified 0Are disentangled representations all you need to build speaker anonymization systems? Aug 22, 2022 All Automatic Speech Recognition
— Unverified 0Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion Aug 18, 2022 Disentanglement Rhythm
Code Code Available 1Differentiable WORLD Synthesizer-based Neural Vocoder With Application To End-To-End Audio Style Transfer Aug 15, 2022 Style Transfer Voice Conversion
— Unverified 0TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training Aug 8, 2022 Voice Conversion
— Unverified 0Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation Jul 29, 2022 Data Augmentation text-to-speech
— Unverified 0Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis Jul 25, 2022 Data Augmentation Speech Synthesis
— Unverified 0A Comparative Study of Self-supervised Speech Representation Based Voice Conversion Jul 10, 2022 Voice Conversion
Code Code Available 1GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion Jul 4, 2022 Voice Conversion
— Unverified 0A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion Jun 28, 2022 Speaker Recognition Voice Conversion
— Unverified 0Comparison of Speech Representations for the MOS Prediction System Jun 28, 2022 Self-Supervised Learning text-to-speech
— Unverified 0