| Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition | Nov 15, 2022 | AllEmotion Classification | CodeCode Available | 0 |
| Robust speaker recognition using unsupervised adversarial invariance | Nov 3, 2019 | speaker-diarizationSpeaker Diarization | CodeCode Available | 0 |
| An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments | Feb 26, 2020 | Face RecognitionFew-Shot Learning | CodeCode Available | 0 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation | May 16, 2020 | blind source separationData Augmentation | CodeCode Available | 0 |
| Personal VAD: Speaker-Conditioned Voice Activity Detection | Aug 12, 2019 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Inconsistency Ranking-based Noisy Label Detection for High-quality Data | Dec 1, 2022 | Metric LearningSpeaker Recognition | CodeCode Available | 0 |
| Conditional independence for pretext task selection in Self-supervised speech representation learning | Apr 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving fairness in speaker verification via Group-adapted Fusion Network | Feb 23, 2022 | FairnessSpeaker Recognition | CodeCode Available | 0 |
| CoLMbo: Speaker Language Model for Descriptive Profiling | Jun 11, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 0 |
| Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems | Nov 3, 2019 | Adversarial AttackSpeaker Recognition | CodeCode Available | 0 |
| Filterbank design for end-to-end speech separation | Oct 23, 2019 | Speaker RecognitionSpeech Separation | CodeCode Available | 0 |
| DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis | Dec 9, 2020 | Speaker RecognitionSpeech Synthesis | CodeCode Available | 0 |
| Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders | Oct 25, 2019 | General ClassificationRepresentation Learning | CodeCode Available | 0 |
| Deep Speaker Vector Normalization with Maximum Gaussianality Training | Oct 30, 2020 | Speaker Recognition | CodeCode Available | 0 |
| Deep Speaker: an End-to-End Neural Speaker Embedding System | May 5, 2017 | ClusteringSpeaker Identification | CodeCode Available | 0 |
| U-vectors: Generating clusterable speaker embedding from unlabeled data | Feb 7, 2021 | Domain AdaptationSpeaker Recognition | CodeCode Available | 0 |
| A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states | Jul 9, 2024 | ArticlesClassification | CodeCode Available | 0 |
| Unified Hypersphere Embedding for Speaker Recognition | Jul 22, 2018 | Speaker RecognitionText-Independent Speaker Recognition | CodeCode Available | 0 |
| VoxCeleb2: Deep Speaker Recognition | Jun 14, 2018 | Speaker RecognitionSpeaker Verification | CodeCode Available | 0 |
| CN-CELEB: a challenging Chinese speaker recognition dataset | Oct 31, 2019 | Speaker Recognition | CodeCode Available | 0 |
| Version Control of Speaker Recognition Systems | Jul 23, 2020 | Speaker Recognition | CodeCode Available | 0 |
| SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems | Sep 14, 2023 | Feature EngineeringInference Attack | CodeCode Available | 0 |
| Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition | Oct 13, 2020 | SentenceSpeaker Recognition | CodeCode Available | 0 |
| Attention-Based Models for Text-Dependent Speaker Verification | Oct 28, 2017 | Image CaptioningMachine Translation | CodeCode Available | 0 |