Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems Jun 18, 2022 Speaker Identification Speaker Verification
— Unverified 0Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning Jun 15, 2022 Attribute Emotion Classification
Code Code Available 1End-to-End Voice Conversion with Information Perturbation Jun 15, 2022 Voice Conversion
— Unverified 0Speak Like a Dog: Human to Non-human creature Voice Conversion Jun 9, 2022 Generative Adversarial Network Voice Conversion
Code Code Available 1Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos Jun 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions May 19, 2022 Speech Synthesis Style Transfer
Code Code Available 1Towards Improved Zero-shot Voice Conversion with Conditional DSVAE May 11, 2022 Voice Conversion
Code Code Available 1Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts May 10, 2022 Speech Synthesis Voice Conversion
Code Code Available 0Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 0Multi-task learning improves synthetic speech detection Apr 27, 2022 Multi-Task Learning Speaker Verification
Code Code Available 0Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation Apr 21, 2022 Data Augmentation text-to-speech
— Unverified 0Time Domain Adversarial Voice Conversion for ADD 2022 Apr 19, 2022 Voice Conversion
— Unverified 0Audio Deep Fake Detection System with Neural Stitching for ADD 2022 Apr 19, 2022 text-to-speech Text to Speech
— Unverified 0The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Apr 11, 2022 Speaker Verification Speech Synthesis
— Unverified 0Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification Apr 6, 2022 Attribute Speaker Verification
— Unverified 0Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective Apr 5, 2022 Disentanglement Representation Learning
— Unverified 0Self-Supervised Speech Representations Preserve Speech Characteristics while Anonymizing Voices Apr 4, 2022 Speaker Verification speech-recognition
— Unverified 0Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck Apr 4, 2022 Speaker Verification text-to-speech
— Unverified 0Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis Apr 1, 2022 Speech Synthesis Voice Conversion
Code Code Available 0WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 0HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 1Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution Mar 31, 2022 Voice Conversion
Code Code Available 1Enhancing Zero-Shot Many to Many Voice Conversion with Self-Attention VAE Mar 30, 2022 Decoder Sentence
— Unverified 0Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice Conversion Mar 30, 2022 Data Augmentation Decoder
Code Code Available 1ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion Mar 29, 2022 Rhythm Voice Conversion
— Unverified 0Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE Mar 28, 2022 Speech Synthesis Voice Conversion
— Unverified 0A Speech Representation Anonymization Framework via Selective Noise Perturbation Mar 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks Mar 26, 2022 Disentanglement Rhythm
Code Code Available 1Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Separating Content from Speaker Identity in Speech for the Assessment of Cognitive Impairments Mar 21, 2022 Speaker Verification Voice Conversion
— Unverified 0Improve few-shot voice cloning using multi-modal learning Mar 18, 2022 text-to-speech Text to Speech
— Unverified 0Text-free non-parallel many-to-many voice conversion using normalising flows Mar 15, 2022 Normalising Flows Speech Synthesis
— Unverified 0iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform Mar 4, 2022 Speech Synthesis text-to-speech
Code Code Available 2Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph Feb 24, 2022 Decoder Quantization
Code Code Available 1VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Feb 18, 2022 Quantization Speech Synthesis
— Unverified 0Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Feb 16, 2022 Speech Synthesis text-to-speech
— Unverified 0Partially Fake Audio Detection by Self-attention-based Fake Span Discovery Feb 14, 2022 Open-Ended Question Answering Question Answering
— Unverified 0Cross-speaker style transfer for text-to-speech using data augmentation Feb 10, 2022 Data Augmentation Style Transfer
— Unverified 0Invertible Voice Conversion Jan 26, 2022 Voice Conversion
— Unverified 0The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition Jan 13, 2022 Generative Adversarial Network Phoneme Recognition
— Unverified 0A Practical Guide to Logical Access Voice Presentation Attack Detection Jan 10, 2022 Artifact Detection Speaker Verification
Code Code Available 0Emotion Intensity and its Control for Emotional Voice Conversion Jan 10, 2022 Emotion Classification Voice Conversion
— Unverified 0Adversarial Transformation of Spoofing Attacks for Voice Biometrics Jan 4, 2022 Speaker Verification Voice Conversion
— Unverified 0IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion Jan 2, 2022 Quantization Voice Conversion
— Unverified 0The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices Dec 15, 2021 Speaker Identification Voice Conversion
— Unverified 0Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features Dec 8, 2021 Decoder Self-Supervised Learning
— Unverified 0Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion Dec 6, 2021 Decoder Voice Conversion
— Unverified 0YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone Dec 4, 2021 Speech Synthesis Text-To-Speech Synthesis
Code Code Available 1