Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery May 4, 2021 Acoustic Unit Discovery Voice Conversion
— Unverified 0Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Voice Conversion Can Improve ASR in Very Low-Resource Settings Nov 4, 2021 Data Augmentation speech-recognition
— Unverified 0Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion Aug 28, 2020 Voice Conversion
— Unverified 0Voice Conversion for Stuttered Speech, Instruments, Unseen Languages and Textually Described Voices Oct 12, 2023 Voice Conversion
— Unverified 0Voice Conversion for Whispered Speech Synthesis Dec 11, 2019 Speech Synthesis Voice Conversion
— Unverified 0Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification May 30, 2025 Dialect Identification Voice Conversion
— Unverified 0Many-to-Many Voice Conversion using Cycle-Consistent Variational Autoencoder with Multiple Decoders Sep 15, 2019 Voice Conversion
— Unverified 0Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities Apr 10, 2017 speech-recognition Speech Recognition
— Unverified 0Voice Conversion with Conditional SampleRNN Aug 24, 2018 Voice Conversion
— Unverified 0Voice Conversion with Diverse Intonation using Conditional Variational Auto-Encoder Apr 16, 2025 Diversity Voice Conversion
— Unverified 0Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Feb 16, 2022 Speech Synthesis text-to-speech
— Unverified 0VoiceMixer: Adversarial Voice Style Mixup Dec 1, 2021 Disentanglement Representation Learning
— Unverified 0VoicePrompter: Robust Zero-Shot Voice Conversion with Voice Prompt and Conditional Flow Matching Jan 29, 2025 Decoder In-Context Learning
— Unverified 0VoiceWukong: Benchmarking Deepfake Voice Detection Sep 10, 2024 Benchmarking Face Swapping
— Unverified 0Vowels and Prosody Contribution in Neural Network Based Voice Conversion Algorithm with Noisy Training Data Mar 10, 2020 Voice Conversion
— Unverified 0VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VSVC: Backdoor attack against Keyword Spotting based on Voiceprint Selection and Voice Conversion Dec 20, 2022 Backdoor Attack Keyword Spotting
— Unverified 0Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes Nov 29, 2023 Face Recognition Face Swapping
— Unverified 0WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks Sep 25, 2018 Speech Synthesis Voice Conversion
— Unverified 0WaveNet 聲碼器及其於語音轉換之應用 (WaveNet Vocoder and its Applications in Voice Conversion) [In Chinese] Oct 1, 2018 Voice Conversion
— Unverified 0WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 0We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings Jul 5, 2024 Speaker Recognition Speech Synthesis
— Unverified 0When Automatic Voice Disguise Meets Automatic Speaker Verification Sep 15, 2020 Miscellaneous Speaker Verification
— Unverified 0When Humans Growl and Birds Speak: High-Fidelity Voice Conversion from Human to Animal and Designed Sounds May 30, 2025 Voice Conversion
— Unverified 0Who is Authentic Speaker Apr 30, 2024 Speaker Recognition Voice Conversion
— Unverified 0Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning Oct 27, 2021 Disentanglement Representation Learning
— Unverified 0Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support Apr 30, 2019 Speaker Identification Voice Conversion
— Unverified 0StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching Dec 6, 2024 Voice Conversion
— Unverified 0StarGAN-based Emotional Voice Conversion for Japanese Phrases Apr 5, 2021 Voice Conversion
— Unverified 0StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition Aug 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts May 31, 2021 Voice Conversion
— Unverified 0StarVC: A Unified Auto-Regressive Framework for Joint Text and Speech Generation in Voice Conversion Jun 3, 2025 Voice Conversion
— Unverified 0Stepback: Enhanced Disentanglement for Voice Conversion via Multi-Task Learning Jan 26, 2025 Disentanglement Multi-Task Learning
— Unverified 0Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0StreamVC: Real-Time Low-Latency Voice Conversion Jan 5, 2024 Speech Synthesis Voice Conversion
— Unverified 0StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion Aug 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Jan 19, 2024 Language Modeling Language Modelling
— Unverified 0Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data Sep 6, 2023 Decoder Self-Supervised Learning
— Unverified 0SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models Jun 12, 2024 Voice Conversion Voice Similarity
— Unverified 0Taco-VC: A Single Speaker Tacotron based Voice Conversion with Limited Data Apr 6, 2019 Phoneme Recognition Speech Enhancement
— Unverified 0Takin-VC: Expressive Zero-Shot Voice Conversion via Adaptive Hybrid Content Encoding and Enhanced Timbre Modeling Oct 2, 2024 Voice Conversion
— Unverified 0Text-free non-parallel many-to-many voice conversion using normalising flows Mar 15, 2022 Normalising Flows Speech Synthesis
— Unverified 0Textless NLP -- Zero Resource Challenge with Low Resource Compute Sep 24, 2024 Acoustic Unit Discovery GPU
— Unverified 0TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training Aug 8, 2022 Voice Conversion
— Unverified 0The Academia Sinica Systems of Voice Conversion for VCC2020 Oct 6, 2020 Task 2 Voice Conversion
— Unverified 0ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech Nov 5, 2019 Person Recognition Speaker Verification
— Unverified 0The Database and Benchmark for the Source Speaker Tracing Challenge 2024 Jun 7, 2024 Multi-Task Learning Speaker Verification
— Unverified 0The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition Jan 13, 2022 Generative Adversarial Network Phoneme Recognition
— Unverified 0The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices Dec 15, 2021 Speaker Identification Voice Conversion
— Unverified 0