Transfer the linguistic representations from TTS to accent conversion with non-parallel data Jan 7, 2024 text-to-speech Text to Speech
— Unverified 00 Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis Jul 25, 2022 Data Augmentation Speech Synthesis
— Unverified 00 Two-Stage Voice Anonymization for Enhanced Privacy Jun 28, 2023 Voice Conversion
— Unverified 00 UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion Jan 10, 2023 Quantization text-to-speech
— Unverified 00 Unsupervised Cross-Domain Singing Voice Conversion Aug 6, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR Jan 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Unsupervised Singing Voice Conversion Apr 13, 2019 Data Augmentation Decoder
— Unverified 00 Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset Sep 14, 2022 text-to-speech Text to Speech
— Unverified 00 USTC-KXDIGIT System Description for ASVspoof5 Challenge Sep 3, 2024 DeepFake Detection Face Swapping
— Unverified 00 V2S attack: building DNN-based voice conversion from automatic speaker verification Aug 5, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech Nov 3, 2020 Decoder Disentanglement
— Unverified 00 VAW-GAN for Singing Voice Conversion with Non-parallel Training Data Aug 10, 2020 Decoder Generative Adversarial Network
— Unverified 00 VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion Sep 10, 2024 Bandwidth Extension Voice Conversion
— Unverified 00 V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Feb 18, 2022 Quantization Speech Synthesis
— Unverified 00 vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders Sep 3, 2024 Speech Synthesis Voice Conversion
— Unverified 00 Versatile Speech Databases for High Quality Synthesis for Basque May 1, 2012 Emotional Speech Synthesis Speech Synthesis
— Unverified 00 Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement Feb 11, 2025 Disentanglement text-to-speech
— Unverified 00 VibE-SVC: Vibrato Extraction with High-frequency F0 Contour for Singing Voice Conversion May 27, 2025 Voice Conversion
— Unverified 00 VITS-Based Singing Voice Conversion Leveraging Whisper and multi-scale F0 Modeling Oct 4, 2023 Decoder Voice Conversion
— Unverified 00 Voice Conversion Augmentation for Speaker Recognition on Defective Datasets Apr 1, 2024 Speaker Recognition Voice Conversion
— Unverified 00 Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery May 4, 2021 Acoustic Unit Discovery Voice Conversion
— Unverified 00 Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Voice Conversion Can Improve ASR in Very Low-Resource Settings Nov 4, 2021 Data Augmentation speech-recognition
— Unverified 00 Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Aug 28, 2020 Voice Conversion
— Unverified 00 Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices Oct 12, 2023 Voice Conversion
— Unverified 00 Voice Conversion for Whispered Speech Synthesis Dec 11, 2019 Speech Synthesis Voice Conversion
— Unverified 00 Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification May 30, 2025 Dialect Identification Voice Conversion
— Unverified 00 Many-to-Many Voice Conversion using Cycle-Consistent Variational Autoencoder with Multiple Decoders Sep 15, 2019 Voice Conversion
— Unverified 00 Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities Apr 10, 2017 speech-recognition Speech Recognition
— Unverified 00 Voice Conversion with Conditional SampleRNN Aug 24, 2018 Voice Conversion
— Unverified 00 Voice Conversion with Diverse Intonation using Conditional Variational Auto-Encoder Apr 16, 2025 Diversity Voice Conversion
— Unverified 00 Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Feb 16, 2022 Speech Synthesis text-to-speech
— Unverified 00 VoiceMixer: Adversarial Voice Style Mixup Dec 1, 2021 Disentanglement Representation Learning
— Unverified 00 VoicePrompter: Robust Zero-Shot Voice Conversion with Voice Prompt and Conditional Flow Matching Jan 29, 2025 Decoder In-Context Learning
— Unverified 00 VoiceWukong: Benchmarking Deepfake Voice Detection Sep 10, 2024 Benchmarking Face Swapping
— Unverified 00 Vowels and Prosody Contribution in Neural Network Based Voice Conversion Algorithm with Noisy Training Data Mar 10, 2020 Voice Conversion
— Unverified 00 VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion Dec 20, 2022 Backdoor Attack Keyword Spotting
— Unverified 00 Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes Nov 29, 2023 Face Recognition Face Swapping
— Unverified 00 WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks Sep 25, 2018 Speech Synthesis Voice Conversion
— Unverified 00 WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese] Oct 1, 2018 Voice Conversion
— Unverified 00 WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 00 We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings Jul 5, 2024 Speaker Recognition Speech Synthesis
— Unverified 00 When Automatic Voice Disguise Meets Automatic Speaker Verification Sep 15, 2020 Miscellaneous Speaker Verification
— Unverified 00 When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds May 30, 2025 Voice Conversion
— Unverified 00 Who is Authentic Speaker Apr 30, 2024 Speaker Recognition Voice Conversion
— Unverified 00 Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning Oct 27, 2021 Disentanglement Representation Learning
— Unverified 00 ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training Jan 8, 2025 In-Context Learning Voice Conversion
— Unverified 00 Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder Jul 11, 2021 Disentanglement Voice Conversion
— Unverified 00