Preserving background sound in noise-robust voice conversion via multi-task learning Nov 6, 2022 Multi-Task Learning Voice Conversion
— Unverified 0Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation Oct 31, 2022 Decoder Disentanglement
— Unverified 0Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection Oct 31, 2022 Audio Compression Face Swapping
— Unverified 0Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE Oct 25, 2022 Disentanglement Representation Learning
— Unverified 0Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion Oct 25, 2022 Attribute Voice Conversion
— Unverified 0MetaSpeech: Speech Effects Switch Along with Environment for Metaverse Oct 25, 2022 Voice Conversion
— Unverified 0Robust One-Shot Singing Voice Conversion Oct 20, 2022 Voice Conversion
— Unverified 0DisC-VC: Disentangled and F0-Controllable Neural Voice Conversion Oct 20, 2022 Voice Conversion
— Unverified 0Boosting Star-GANs for Voice Conversion with Contrastive Discriminator Sep 21, 2022 Contrastive Learning Voice Conversion
— Unverified 0Non-Parallel Voice Conversion for ASR Augmentation Sep 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset Sep 14, 2022 text-to-speech Text to Speech
— Unverified 0Investigation into Target Speaking Rate Adaptation for Voice Conversion Sep 5, 2022 Disentanglement Representation Learning
— Unverified 0Are disentangled representations all you need to build speaker anonymization systems? Aug 22, 2022 All Automatic Speech Recognition
— Unverified 0Differentiable WORLD Synthesizer-based Neural Vocoder With Application To End-To-End Audio Style Transfer Aug 15, 2022 Style Transfer Voice Conversion
— Unverified 0TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training Aug 8, 2022 Voice Conversion
— Unverified 0Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation Jul 29, 2022 Data Augmentation text-to-speech
— Unverified 0Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis Jul 25, 2022 Data Augmentation Speech Synthesis
— Unverified 0GlowVC: Mel-spectrogram space disentangling model for language-independent text-free voice conversion Jul 4, 2022 Voice Conversion
— Unverified 0A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion Jun 28, 2022 Speaker Recognition Voice Conversion
— Unverified 0Comparison of Speech Representations for the MOS Prediction System Jun 28, 2022 Self-Supervised Learning text-to-speech
— Unverified 0Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems Jun 18, 2022 Speaker Identification Speaker Verification
— Unverified 0End-to-End Voice Conversion with Information Perturbation Jun 15, 2022 Voice Conversion
— Unverified 0Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos Jun 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts May 10, 2022 Speech Synthesis Voice Conversion
Code Code Available 0Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 0Multi-task learning improves synthetic speech detection Apr 27, 2022 Multi-Task Learning Speaker Verification
Code Code Available 0Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation Apr 21, 2022 Data Augmentation text-to-speech
— Unverified 0Audio Deep Fake Detection System with Neural Stitching for ADD 2022 Apr 19, 2022 text-to-speech Text to Speech
— Unverified 0Time Domain Adversarial Voice Conversion for ADD 2022 Apr 19, 2022 Voice Conversion
— Unverified 0The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Apr 11, 2022 Speaker Verification Speech Synthesis
— Unverified 0Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification Apr 6, 2022 Attribute Speaker Verification
— Unverified 0Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective Apr 5, 2022 Disentanglement Representation Learning
— Unverified 0Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices Apr 4, 2022 Speaker Verification speech-recognition
— Unverified 0Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck Apr 4, 2022 Speaker Verification text-to-speech
— Unverified 0Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis Apr 1, 2022 Speech Synthesis Voice Conversion
Code Code Available 0WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 0Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE Mar 30, 2022 Decoder Sentence
— Unverified 0An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion Mar 29, 2022 Rhythm Voice Conversion
— Unverified 0Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE Mar 28, 2022 Speech Synthesis Voice Conversion
— Unverified 0A Speech Representation Anonymization Framework via Selective Noise Perturbation Mar 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments Mar 21, 2022 Speaker Verification Voice Conversion
— Unverified 0Improve few-shot voice cloning using multi-modal learning Mar 18, 2022 text-to-speech Text to Speech
— Unverified 0Text-free non-parallel many-to-many voice conversion using normalising flows Mar 15, 2022 Normalising Flows Speech Synthesis
— Unverified 0VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Feb 18, 2022 Quantization Speech Synthesis
— Unverified 0Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Feb 16, 2022 Speech Synthesis text-to-speech
— Unverified 0Partially Fake Audio Detection by Self-attention-based Fake Span Discovery Feb 14, 2022 Open-Ended Question Answering Question Answering
— Unverified 0