| Bias in Automated Speaker Recognition | Jan 24, 2022 | BIG-bench Machine LearningFace Recognition | CodeCode Available | 1 | 5 |
| TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech | Jul 12, 2020 | Keyword SpottingSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| Toroidal Probabilistic Spherical Discriminant Analysis | Oct 27, 2022 | FormSpeaker Recognition | CodeCode Available | 1 | 5 |
| Crossed-Time Delay Neural Network for Speaker Recognition | May 31, 2020 | Speaker RecognitionSpeaker Verification | CodeCode Available | 1 | 5 |
| Version Control of Speaker Recognition Systems | Jul 23, 2020 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers? | May 23, 2023 | Caller DetectionSelf-Supervised Learning | CodeCode Available | 0 | 5 |
| U-vectors: Generating clusterable speaker embedding from unlabeled data | Feb 7, 2021 | Domain AdaptationSpeaker Recognition | CodeCode Available | 0 | 5 |
| Additive Margin SincNet for Speaker Recognition | Jan 28, 2019 | Deep LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Unified Hypersphere Embedding for Speaker Recognition | Jul 22, 2018 | Speaker RecognitionText-Independent Speaker Recognition | CodeCode Available | 0 | 5 |
| Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds | Jul 24, 2021 | Data AugmentationInstrument Recognition | CodeCode Available | 0 | 5 |
| VoxCeleb2: Deep Speaker Recognition | Jun 14, 2018 | Speaker RecognitionSpeaker Verification | CodeCode Available | 0 | 5 |
| Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios | May 13, 2023 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation | May 16, 2020 | blind source separationData Augmentation | CodeCode Available | 0 | 5 |
| An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments | Feb 26, 2020 | Face RecognitionFew-Shot Learning | CodeCode Available | 0 | 5 |
| Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition | Oct 13, 2020 | SentenceSpeaker Recognition | CodeCode Available | 0 | 5 |
| To train or not to train adversarially: A study of bias mitigation strategies for speaker recognition | Mar 17, 2022 | Face RecognitionFairness | CodeCode Available | 0 | 5 |
| Excitement Surfeited Turns to Errors: Deep Learning Testing Framework Based on Excitable Neurons | Feb 12, 2022 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| Baselines and Protocols for Household Speaker Recognition | Apr 30, 2022 | Speaker Recognition | CodeCode Available | 0 | 5 |
| A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states | Jul 9, 2024 | ArticlesClassification | CodeCode Available | 0 | 5 |
| Deep Normalization for Speaker Vectors | Apr 7, 2020 | Speaker Recognition | CodeCode Available | 0 | 5 |
| SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems | Sep 14, 2023 | Feature EngineeringInference Attack | CodeCode Available | 0 | 5 |
| Deep generative LDA | Oct 30, 2020 | Dimensionality ReductionSpeaker Recognition | CodeCode Available | 0 | 5 |
| SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System | Apr 5, 2021 | Speaker RecognitionSpeaker Verification | CodeCode Available | 0 | 5 |
| Risk of re-identification for shared clinical speech recordings | Oct 18, 2022 | Speaker Recognition | CodeCode Available | 0 | 5 |
| PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform | May 31, 2021 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| Private kNN-VC: Interpretable Anonymization of Converted Speech | May 23, 2025 | Speaker anonymizationSpeaker Recognition | CodeCode Available | 0 | 5 |
| COVID-19 Patient Detection from Telephone Quality Speech Data | Nov 9, 2020 | SentenceSpeaker Recognition | CodeCode Available | 0 | 5 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Attention-Based Models for Text-Dependent Speaker Verification | Oct 28, 2017 | Image CaptioningMachine Translation | CodeCode Available | 0 | 5 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Personal VAD: Speaker-Conditioned Voice Activity Detection | Aug 12, 2019 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| Prosody-Driven Privacy-Preserving Dementia Detection | Jul 3, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| Masked Proxy Loss For Text-Independent Speaker Verification | Nov 9, 2020 | Metric LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Conditional independence for pretext task selection in Self-supervised speech representation learning | Apr 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent Space | Aug 21, 2021 | Face RecognitionSpeaker Recognition | CodeCode Available | 0 | 5 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 | 5 |
| Improving fairness in speaker verification via Group-adapted Fusion Network | Feb 23, 2022 | FairnessSpeaker Recognition | CodeCode Available | 0 | 5 |
| Robust speaker recognition using unsupervised adversarial invariance | Nov 3, 2019 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 | 5 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| CoLMbo: Speaker Language Model for Descriptive Profiling | Jun 11, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 0 | 5 |
| Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders | Oct 25, 2019 | General ClassificationRepresentation Learning | CodeCode Available | 0 | 5 |
| CN-CELEB: a challenging Chinese speaker recognition dataset | Oct 31, 2019 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Filterbank design for end-to-end speech separation | Oct 23, 2019 | Speaker RecognitionSpeech Separation | CodeCode Available | 0 | 5 |
| 3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization | Mar 29, 2024 | Self-Supervised Learningspeaker-diarization | CodeCode Available | 0 | 5 |
| DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis | Dec 9, 2020 | Speaker RecognitionSpeech Synthesis | CodeCode Available | 0 | 5 |
| Deep Speaker Vector Normalization with Maximum Gaussianality Training | Oct 30, 2020 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Delving into VoxCeleb: environment invariant speaker recognition | Oct 24, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| Certification of Speaker Recognition Models to Additive Perturbations | Apr 29, 2024 | Few-Shot LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Deep Speaker: an End-to-End Neural Speaker Embedding System | May 5, 2017 | ClusteringSpeaker Identification | CodeCode Available | 0 | 5 |