| To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition | Dec 9, 2018 | Multi-Task LearningSpeaker Recognition | —Unverified | 0 | 0 |
| Towards End-to-End Private Automatic Speaker Recognition | Jun 23, 2022 | Privacy PreservingSpeaker Recognition | —Unverified | 0 | 0 |
| Late Audio-Visual Fusion for In-The-Wild Speaker Diarization | Nov 2, 2022 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Towards Relevance and Sequence Modeling in Language Recognition | Apr 2, 2020 | Language IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks | May 29, 2023 | Emotion RecognitionSpeaker Recognition | —Unverified | 0 | 0 |
| Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification | Aug 6, 2019 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Understanding Contrastive Learning Through the Lens of Margins | Jun 20, 2023 | Contrastive LearningRepresentation Learning | —Unverified | 0 | 0 |
| UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023 | Aug 24, 2023 | Speaker Recognition | —Unverified | 0 | 0 |
| Universal speaker recognition encoders for different speech segments duration | Oct 28, 2022 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing | Oct 25, 2023 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Unsupervised Adaptation of SPLDA | Nov 20, 2015 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Unsupervised Learning of Disentangled Speech Content and Style Representation | Oct 24, 2020 | DecoderSpeaker Recognition | —Unverified | 0 | 0 |
| 以二維共振峰分布建立語者音色模型及其在語者驗證上之應用 (Using 2D Formant Distribution to Build Speaker Models and Its Application in Speaker Verification) [In Chinese] | Oct 1, 2014 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation | Oct 24, 2016 | ClusteringDimensionality Reduction | —Unverified | 0 | 0 |
| Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework | May 25, 2021 | Speaker Recognition | —Unverified | 0 | 0 |
| VAE-based regularization for deep speaker embedding | Apr 7, 2019 | Speaker Recognition | —Unverified | 0 | 0 |
| Variational Autoencoders with implicit priors for short-duration text-independent speaker verification | Oct 22, 2018 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Visual Speech Recognition | Sep 3, 2014 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 | 0 |
| Voice Conversion Augmentation for Speaker Recognition on Defective Datasets | Apr 1, 2024 | Speaker RecognitionVoice Conversion | —Unverified | 0 | 0 |
| Voice Morphing: Two Identities in One Voice | Sep 5, 2023 | MORPHSpeaker Recognition | —Unverified | 0 | 0 |
| Voice Quality and Pitch Features in Transformer-Based Speech Recognition | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices | Dec 20, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| VoxBlink: A Large Scale Speaker Verification Dataset on Camera | Aug 14, 2023 | Speaker RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge | Dec 5, 2019 | Speaker Recognition | —Unverified | 0 | 0 |
| VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge | Dec 12, 2020 | Speaker Recognition | —Unverified | 0 | 0 |
| VoxVietnam: a Large-Scale Multi-Genre Dataset for Vietnamese Speaker Recognition | Dec 31, 2024 | DiversitySpeaker Recognition | —Unverified | 0 | 0 |
| VoxWatch: An open-set speaker recognition benchmark on VoxCeleb | Jun 30, 2023 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 | 0 |
| Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes | Nov 29, 2023 | Face RecognitionFace Swapping | —Unverified | 0 | 0 |
| WeCanTalk: A New Multi-language, Multi-modal Resource for Speaker Recognition | Jun 1, 2022 | Speaker Recognition | —Unverified | 0 | 0 |
| We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings | Jul 5, 2024 | Speaker RecognitionSpeech Synthesis | —Unverified | 0 | 0 |
| What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis | Jul 1, 2021 | Decision MakingDialect Identification | —Unverified | 0 | 0 |
| Who is Authentic Speaker | Apr 30, 2024 | Speaker RecognitionVoice Conversion | —Unverified | 0 | 0 |
| Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? | Apr 27, 2022 | Self-Supervised LearningSpeaker Recognition | —Unverified | 0 | 0 |
| Xi-Vector Embedding for Speaker Recognition | Aug 12, 2021 | Speaker Recognition | —Unverified | 0 | 0 |
| XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 2021 | Sep 6, 2021 | Speaker Recognition | —Unverified | 0 | 0 |
| x-vectors meet emotions: A study on dependencies between emotion and speaker recognition | Feb 12, 2020 | Emotion ClassificationEmotion Recognition | —Unverified | 0 | 0 |
| 3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization | Mar 29, 2024 | Self-Supervised Learningspeaker-diarization | —Unverified | 0 | 0 |
| 以三元組損失微調時延神經網路語者嵌入函數之語者辨識系統(Time Delay Neural Network-based Speaker Embedding Function Fine-tuned with Triplet Loss for Distance-based Speaker Recognition) | Oct 1, 2019 | Speaker RecognitionTriplet | —Unverified | 0 | 0 |
| A Benchmark for Understanding and Generating Dialogue between Characters in Stories | Sep 18, 2022 | Dialogue GenerationSpeaker Recognition | —Unverified | 0 | 0 |
| A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments | Jun 17, 2025 | DenoisingSpeaker Recognition | —Unverified | 0 | 0 |
| A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition | Apr 22, 2023 | Emotion RecognitionSpeaker Recognition | —Unverified | 0 | 0 |
| A comparative study of several parameterizations for speaker recognition | Feb 24, 2022 | QuantizationSpeaker Recognition | —Unverified | 0 | 0 |
| A comparison of linear and non-linear calibrations for speaker recognition | Feb 11, 2014 | Speaker Recognition | —Unverified | 0 | 0 |
| A Deep Neural Network for Short-Segment Speaker Recognition | Jul 22, 2019 | Speaker Recognition | —Unverified | 0 | 0 |
| Adversarial defense for deep speaker recognition using hybrid adversarial training | Oct 30, 2020 | Adversarial DefenseSpeaker Recognition | —Unverified | 0 | 0 |
| Adversarial Speaker Verification | Apr 29, 2019 | General ClassificationSpeaker Recognition | —Unverified | 0 | 0 |
| A Generative Model for Score Normalization in Speaker Recognition | Sep 28, 2017 | Speaker Recognition | —Unverified | 0 | 0 |
| A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion | Jun 28, 2022 | Speaker RecognitionVoice Conversion | —Unverified | 0 | 0 |
| A Lightweight Speaker Recognition System Using Timbre Properties | Oct 12, 2020 | GPUSpeaker Identification | —Unverified | 0 | 0 |
| A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning | Aug 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |