| SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification | Sep 18, 2021 | Neural Architecture SearchSpeaker Recognition | CodeCode Available | 1 | 5 |
| Universal Adversarial Perturbations Generative Network for Speaker Recognition | Apr 7, 2020 | Speaker Recognition | CodeCode Available | 1 | 5 |
| Toroidal Probabilistic Spherical Discriminant Analysis | Oct 27, 2022 | FormSpeaker Recognition | CodeCode Available | 1 | 5 |
| BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition | Jun 30, 2019 | AvgRepresentation Learning | CodeCode Available | 1 | 5 |
| PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform | May 31, 2021 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems | Sep 14, 2023 | Feature EngineeringInference Attack | CodeCode Available | 0 | 5 |
| Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers? | May 23, 2023 | Caller DetectionSelf-Supervised Learning | CodeCode Available | 0 | 5 |
| SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System | Apr 5, 2021 | Speaker RecognitionSpeaker Verification | CodeCode Available | 0 | 5 |
| Robust speaker recognition using unsupervised adversarial invariance | Nov 3, 2019 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 | 5 |
| Additive Margin SincNet for Speaker Recognition | Jan 28, 2019 | Deep LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation | May 16, 2020 | blind source separationData Augmentation | CodeCode Available | 0 | 5 |
| An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments | Feb 26, 2020 | Face RecognitionFew-Shot Learning | CodeCode Available | 0 | 5 |
| Prosody-Driven Privacy-Preserving Dementia Detection | Jul 3, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Baselines and Protocols for Household Speaker Recognition | Apr 30, 2022 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Private kNN-VC: Interpretable Anonymization of Converted Speech | May 23, 2025 | Speaker anonymizationSpeaker Recognition | CodeCode Available | 0 | 5 |
| A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states | Jul 9, 2024 | ArticlesClassification | CodeCode Available | 0 | 5 |
| Personal VAD: Speaker-Conditioned Voice Activity Detection | Aug 12, 2019 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| Risk of re-identification for shared clinical speech recordings | Oct 18, 2022 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders | Oct 25, 2019 | General ClassificationRepresentation Learning | CodeCode Available | 0 | 5 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 | 5 |
| Attention-Based Models for Text-Dependent Speaker Verification | Oct 28, 2017 | Image CaptioningMachine Translation | CodeCode Available | 0 | 5 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |