Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis Feb 28, 2020 Speech Synthesis text-to-speech
Code Code Available 05 A Practical Guide to Logical Access Voice Presentation Attack Detection Jan 10, 2022 Artifact Detection Speaker Verification
Code Code Available 05 Spoof detection using time-delay shallow neural network and feature switching Apr 16, 2019 Speaker Verification Speech Synthesis
Code Code Available 05 Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals Jan 19, 2018 Speech Synthesis Voice Conversion
Code Code Available 05 Scalable Factorized Hierarchical Variational Autoencoder Training Apr 9, 2018 Disentanglement Hyperparameter Optimization
Code Code Available 05 SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines Nov 6, 2021 Disentanglement Speaker Verification
Code Code Available 05 Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis Apr 1, 2022 Speech Synthesis Voice Conversion
Code Code Available 05 Emotional Voice Conversion using Multitask Learning with Text-to-speech Nov 11, 2019 Decoder text-to-speech
Code Code Available 05 Playing with Voices: Tabletop Role-Playing Game Recordings as a Diarization Challenge Feb 18, 2025 Voice Conversion
Code Code Available 05 Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 05 Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks Nov 30, 2017 Voice Conversion
Code Code Available 05 Private kNN-VC: Interpretable Anonymization of Converted Speech May 23, 2025 Speaker anonymization Speaker Recognition
Code Code Available 05 Non-Parallel Training Approach for Emotional Voice Conversion Using CycleGAN Nov 1, 2023 Voice Conversion
Code Code Available 05 NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 05 Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations Apr 9, 2018 Decoder Voice Conversion
Code Code Available 05 Mel-spectrogram augmentation for sequence to sequence voice conversion Jan 6, 2020 Voice Conversion
Code Code Available 05 Multi-task learning improves synthetic speech detection Apr 27, 2022 Multi-Task Learning Speaker Verification
Code Code Available 05 Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts May 10, 2022 Speech Synthesis Voice Conversion
Code Code Available 05 Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion May 25, 2023 Audio Deepfake Detection DeepFake Detection
Code Code Available 05 Investigating on Incorporating Pretrained and Learnable Speaker Representations for Multi-Speaker Multi-Style Text-to-Speech Mar 6, 2021 text-to-speech Text to Speech
Code Code Available 05 Anonymising Elderly and Pathological Speech: Voice Conversion Using DDSP and Query-by-Example Oct 20, 2024 Voice Conversion
Code Code Available 05 Hear Your Face: Face-based voice conversion with F0 estimation Aug 19, 2024 Voice Conversion
Code Code Available 05 Improving Zero-shot Voice Style Transfer via Disentangled Representation Learning Mar 17, 2021 Decoder Representation Learning
Code Code Available 05 MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms Oct 8, 2019 Generative Adversarial Network Music Style Transfer
Code Code Available 05 Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion May 28, 2019 Decoder Voice Conversion
Code Code Available 05 Delivering Speaking Style in Low-resource Voice Conversion with Multi-factor Constraints Nov 16, 2022 Voice Conversion
— Unverified 00 A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction Dec 11, 2024 Decoder Self-Supervised Learning
— Unverified 00 DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices Aug 15, 2020 Speaker Recognition Voice Conversion
— Unverified 00 Audio Deep Fake Detection System with Neural Stitching for ADD 2022 Apr 19, 2022 text-to-speech Text to Speech
— Unverified 00 Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling Aug 9, 2020 Deep Learning Speech Synthesis
— Unverified 00 Deep Learning-based F0 Synthesis for Speaker Anonymization Jun 29, 2023 Deep Learning Speaker anonymization
— Unverified 00 Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning Nov 17, 2022 Binary Classification Meta-Learning
— Unverified 00 An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR Mar 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Adaptive Speech Duration Modification using a Deep-Generative Framework Sep 29, 2021 Decoder Dynamic Time Warping
— Unverified 00 DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Oct 13, 2021 Speech Synthesis Voice Conversion
— Unverified 00 AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 00 Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 00 Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE Mar 28, 2022 Speech Synthesis Voice Conversion
— Unverified 00 D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack Sep 11, 2024 Adversarial Attack Audio Synthesis
— Unverified 00 Data Augmentation for Diverse Voice Conversion in Noisy Environments May 18, 2023 Data Augmentation Decoder
— Unverified 00 Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion Dec 29, 2023 Contrastive Learning Disentanglement
— Unverified 00 ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 00 An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 00 ACE-VC: Adaptive and Controllable Voice Conversion using Explicitly Disentangled Self-supervised Speech Representations Feb 16, 2023 Self-Supervised Learning Speaker Verification
— Unverified 00 CycleFlow: Purify Information Factors by Cycle Loss Oct 18, 2021 Voice Conversion
— Unverified 00 ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 00 Custom Data Augmentation for low resource ASR using Bark and Retrieval-Based Voice Conversion Nov 24, 2023 Data Augmentation Retrieval
— Unverified 00 CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching Nov 4, 2024 Speaker Verification Voice Conversion
— Unverified 00 ALO-VC: Any-to-any Low-latency One-shot Voice Conversion Jun 1, 2023 CPU Voice Conversion
— Unverified 00 Cross-speaker style transfer for text-to-speech using data augmentation Feb 10, 2022 Data Augmentation Style Transfer
— Unverified 00