| Private kNN-VC: Interpretable Anonymization of Converted Speech | May 23, 2025 | Speaker anonymizationSpeaker Recognition | CodeCode Available | 0 | 5 |
| COVID-19 Patient Detection from Telephone Quality Speech Data | Nov 9, 2020 | SentenceSpeaker Recognition | CodeCode Available | 0 | 5 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Attention-Based Models for Text-Dependent Speaker Verification | Oct 28, 2017 | Image CaptioningMachine Translation | CodeCode Available | 0 | 5 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Personal VAD: Speaker-Conditioned Voice Activity Detection | Aug 12, 2019 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| Prosody-Driven Privacy-Preserving Dementia Detection | Jul 3, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| Masked Proxy Loss For Text-Independent Speaker Verification | Nov 9, 2020 | Metric LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Conditional independence for pretext task selection in Self-supervised speech representation learning | Apr 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health | Feb 8, 2023 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent Space | Aug 21, 2021 | Face RecognitionSpeaker Recognition | CodeCode Available | 0 | 5 |
| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 | 5 |
| Improving fairness in speaker verification via Group-adapted Fusion Network | Feb 23, 2022 | FairnessSpeaker Recognition | CodeCode Available | 0 | 5 |
| Robust speaker recognition using unsupervised adversarial invariance | Nov 3, 2019 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 | 5 |
| Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks | Oct 1, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| CoLMbo: Speaker Language Model for Descriptive Profiling | Jun 11, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 0 | 5 |
| Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders | Oct 25, 2019 | General ClassificationRepresentation Learning | CodeCode Available | 0 | 5 |
| CN-CELEB: a challenging Chinese speaker recognition dataset | Oct 31, 2019 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Filterbank design for end-to-end speech separation | Oct 23, 2019 | Speaker RecognitionSpeech Separation | CodeCode Available | 0 | 5 |
| 3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization | Mar 29, 2024 | Self-Supervised Learningspeaker-diarization | CodeCode Available | 0 | 5 |
| DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis | Dec 9, 2020 | Speaker RecognitionSpeech Synthesis | CodeCode Available | 0 | 5 |
| Deep Speaker Vector Normalization with Maximum Gaussianality Training | Oct 30, 2020 | Speaker Recognition | CodeCode Available | 0 | 5 |
| Delving into VoxCeleb: environment invariant speaker recognition | Oct 24, 2019 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 | 5 |
| Certification of Speaker Recognition Models to Additive Perturbations | Apr 29, 2024 | Few-Shot LearningSpeaker Recognition | CodeCode Available | 0 | 5 |
| Deep Speaker: an End-to-End Neural Speaker Embedding System | May 5, 2017 | ClusteringSpeaker Identification | CodeCode Available | 0 | 5 |