Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder Sep 5, 2024 Disentanglement Voice Conversion
— Unverified 00 Speaker-independent raw waveform model for glottal excitation Apr 25, 2018 model Speech Synthesis
— Unverified 00 Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN Aug 11, 2020 Voice Conversion
— Unverified 00 SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Jan 31, 2024 Decoder Language Modeling
— Unverified 00 Speech Enhancement-assisted Voice Conversion in Noisy Environments Oct 19, 2021 Speech Enhancement Voice Conversion
— Unverified 00 Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Speech Synthesis along Perceptual Voice Quality Dimensions Jan 15, 2025 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching Dec 6, 2024 Voice Conversion
— Unverified 00 StarGAN-based Emotional Voice Conversion for Japanese Phrases Apr 5, 2021 Voice Conversion
— Unverified 00 StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition Aug 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts May 31, 2021 Voice Conversion
— Unverified 00 StarVC: A Unified Auto-Regressive Framework for Joint Text and Speech Generation in Voice Conversion Jun 3, 2025 Voice Conversion
— Unverified 00 Stepback: Enhanced Disentanglement for Voice Conversion via Multi-Task Learning Jan 26, 2025 Disentanglement Multi-Task Learning
— Unverified 00 Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 StreamVC: Real-Time Low-Latency Voice Conversion Jan 5, 2024 Speech Synthesis Voice Conversion
— Unverified 00 StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion Aug 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion Jan 19, 2024 Language Modeling Language Modelling
— Unverified 00 Stylebook: Content-Dependent Speaking Style Modeling for Any-to-Any Voice Conversion using Only Speech Data Sep 6, 2023 Decoder Self-Supervised Learning
— Unverified 00 SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models Jun 12, 2024 Voice Conversion Voice Similarity
— Unverified 00 Taco-VC: A Single Speaker Tacotron based Voice Conversion with Limited Data Apr 6, 2019 Phoneme Recognition Speech Enhancement
— Unverified 00 Takin-VC: Expressive Zero-Shot Voice Conversion via Adaptive Hybrid Content Encoding and Enhanced Timbre Modeling Oct 2, 2024 Voice Conversion
— Unverified 00 Text-free non-parallel many-to-many voice conversion using normalising flows Mar 15, 2022 Normalising Flows Speech Synthesis
— Unverified 00 Textless NLP -- Zero Resource Challenge with Low Resource Compute Sep 24, 2024 Acoustic Unit Discovery GPU
— Unverified 00 TGAVC: Improving Autoencoder Voice Conversion with Text-Guided and Adversarial Training Aug 8, 2022 Voice Conversion
— Unverified 00 The Academia Sinica Systems of Voice Conversion for VCC2020 Oct 6, 2020 Task 2 Voice Conversion
— Unverified 00 ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech Nov 5, 2019 Person Recognition Speaker Verification
— Unverified 00 The Database and Benchmark for the Source Speaker Tracing Challenge 2024 Jun 7, 2024 Multi-Task Learning Speaker Verification
— Unverified 00 The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition Jan 13, 2022 Generative Adversarial Network Phoneme Recognition
— Unverified 00 The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices Dec 15, 2021 Speaker Identification Voice Conversion
— Unverified 00 The Impact of Silence on Speech Anti-Spoofing Sep 21, 2023 Action Detection Activity Detection
— Unverified 00 The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders Oct 9, 2020 Task 2 Voice Conversion
— Unverified 00 The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Apr 11, 2022 Speaker Verification Speech Synthesis
— Unverified 00 The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge Jul 24, 2015 Speaker Verification Speech Synthesis
— Unverified 00 The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods Apr 12, 2018 Voice Conversion
— Unverified 00 The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation Jul 16, 2024 Automatic Speech Recognition speech-recognition
— Unverified 00 Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion Sep 8, 2021 Dynamic Time Warping Speech Enhancement
— Unverified 00 Time Domain Adversarial Voice Conversion for ADD 2022 Apr 19, 2022 Voice Conversion
— Unverified 00 Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples Aug 23, 2024 Data Augmentation Meta-Learning
— Unverified 00 Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion Jun 4, 2025 Disentanglement Style Transfer
— Unverified 00 Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels Apr 15, 2021 Voice Conversion
— Unverified 00 Towards General-Purpose Text-Instruction-Guided Voice Conversion Sep 25, 2023 Language Modeling Language Modelling
— Unverified 00 Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding Oct 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Towards Identity Preserving Normal to Dysarthric Voice Conversion Oct 15, 2021 Data Augmentation Decision Making
— Unverified 00 Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram Feb 3, 2021 text-to-speech Text to Speech
— Unverified 00 Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion Oct 16, 2020 Speech Synthesis text-to-speech
— Unverified 00 Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline Jun 6, 2024 Voice Conversion
— Unverified 00 Towards Realistic Emotional Voice Conversion using Controllable Emotional Intensity Jul 20, 2024 Diversity Rhythm
— Unverified 00 Towards Robust Neural Vocoding for Speech Generation: A Survey Dec 5, 2019 Speech Synthesis Survey
— Unverified 00 Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features Dec 8, 2021 Decoder Self-Supervised Learning
— Unverified 00 Transcription and translation of videos using fine-tuned XLSR Wav2Vec2 on custom dataset and mBART Mar 1, 2024 Retrieval Translation
— Unverified 00