SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Jan 31, 2024 Decoder Language Modeling
— Unverified 0Speech Enhancement-assisted Voice Conversion in Noisy Environments Oct 19, 2021 Speech Enhancement Voice Conversion
— Unverified 0Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training Jan 8, 2025 In-Context Learning Voice Conversion
— Unverified 0Automatic Voice Identification after Speech Resynthesis using PPG Aug 5, 2024 Resynthesis Speaker Verification
— Unverified 0RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding Jun 12, 2025 CPU Voice Conversion
— Unverified 0AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge Aug 30, 2024 DeepFake Detection Face Swapping
— Unverified 0ACE-VC: Adaptive and Controllable Voice Conversion using Explicitly Disentangled Self-supervised Speech Representations Feb 16, 2023 Self-Supervised Learning Speaker Verification
— Unverified 0A Comparative Analysis Of Latent Regressor Losses For Singing Voice Conversion Feb 27, 2023 Contrastive Learning Disentanglement
— Unverified 0A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023 Oct 8, 2023 Self-Supervised Learning Task 2
— Unverified 0AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion Nov 12, 2021 Voice Conversion
— Unverified 0Adaptive Speech Duration Modification using a Deep-Generative Framework Sep 29, 2021 Decoder Dynamic Time Warping
— Unverified 0AdaptVC: High Quality Voice Conversion with Adaptive Learning Jan 2, 2025 Decoder Disentanglement
— Unverified 0A Deep-Bayesian Framework for Adaptive Speech Duration Modification Jul 11, 2021 Decoder Dynamic Time Warping
— Unverified 0Adversarially learning disentangled speech representations for robust multi-factor voice conversion Jan 30, 2021 Representation Learning Rhythm
— Unverified 0Adversarially Trained Autoencoders for Parallel-Data-Free Voice Conversion May 9, 2019 Voice Conversion
— Unverified 0Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion May 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adversarial speech for voice privacy protection from Personalized Speech generation Jan 22, 2024 Speaker Verification text-to-speech
— Unverified 0Adversarial Transformation of Spoofing Attacks for Voice Biometrics Jan 4, 2022 Speaker Verification Voice Conversion
— Unverified 0AE-Flow: AutoEncoder Normalizing Flow Dec 27, 2023 text-to-speech Text to Speech
— Unverified 0A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion Jun 28, 2022 Speaker Recognition Voice Conversion
— Unverified 0AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment May 8, 2023 cross-modal alignment Rhythm
— Unverified 0ALO-VC: Any-to-any Low-latency One-shot Voice Conversion Jun 1, 2023 CPU Voice Conversion
— Unverified 0An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 0Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE Mar 28, 2022 Speech Synthesis Voice Conversion
— Unverified 0An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR Mar 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Objective Evaluation Framework for Pathological Speech Synthesis Jul 1, 2021 Speech Synthesis Voice Conversion
— Unverified 0An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion Mar 29, 2022 Rhythm Voice Conversion
— Unverified 0An overview of text-to-speech systems and media applications Oct 22, 2023 Acoustic Modelling text-to-speech
— Unverified 0Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck Apr 4, 2022 Speaker Verification text-to-speech
— Unverified 0Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations Oct 23, 2020 Voice Conversion
— Unverified 0A Perception-Based L2 Speech Intelligibility Indicator: Leveraging a Rater's Shadowing and Sequence-to-sequence Voice Conversion May 30, 2025 Voice Conversion
— Unverified 0A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models Jun 27, 2024 Speaker Verification text-to-speech
— Unverified 0A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion Jun 2, 2021 Voice Conversion
— Unverified 0SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers Jan 30, 2024 Voice Conversion
— Unverified 0Are disentangled representations all you need to build speaker anonymization systems? Aug 22, 2022 All Automatic Speech Recognition
— Unverified 0A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech May 8, 2018 Language Identification Speech Synthesis
— Unverified 0ArVoice: A Multi-Speaker Dataset for Arabic Speech Synthesis May 26, 2025 DeepFake Detection Face Swapping
— Unverified 0A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment Apr 23, 2018 Benchmarking Speaker Verification
— Unverified 0ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 0ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 0Attention-based Interactive Disentangling Network for Instance-level Emotional Voice Conversion Dec 29, 2023 Contrastive Learning Disentanglement
— Unverified 0Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 0AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 0Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning Nov 17, 2022 Binary Classification Meta-Learning
— Unverified 0Audio Deep Fake Detection System with Neural Stitching for ADD 2022 Apr 19, 2022 text-to-speech Text to Speech
— Unverified 0A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction Dec 11, 2024 Decoder Self-Supervised Learning
— Unverified 0AutoCycle-VC: Towards Bottleneck-Independent Zero-Shot Cross-Lingual Voice Conversion Oct 10, 2023 Voice Conversion
— Unverified 0Automatic Speech Disentanglement for Voice Conversion using Rank Module and Speech Augmentation Jun 21, 2023 Disentanglement Rhythm
— Unverified 0