| The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge | Sep 5, 2021 | Action DetectionActivity Detection | —Unverified | 0 |
| SEC4SR: A Security Analysis Platform for Speaker Recognition | Sep 4, 2021 | Speaker Recognition | CodeCode Available | 1 |
| Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent Space | Aug 21, 2021 | Face RecognitionSpeaker Recognition | CodeCode Available | 0 |
| NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition | Aug 16, 2021 | Speaker Recognition | —Unverified | 0 |
| Xi-Vector Embedding for Speaker Recognition | Aug 12, 2021 | Speaker Recognition | —Unverified | 0 |
| Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation | Aug 5, 2021 | Emotion ClassificationEmotion Recognition | —Unverified | 0 |
| Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds | Jul 24, 2021 | Data AugmentationInstrument Recognition | CodeCode Available | 0 |
| Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems | Jul 9, 2021 | Representation LearningSpeaker Identification | —Unverified | 0 |
| Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation | Jul 9, 2021 | ClassificationSelf-Supervised Learning | —Unverified | 0 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis | Jul 1, 2021 | Decision MakingDialect Identification | —Unverified | 0 |
| Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition | Jun 18, 2021 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| Graph-based Label Propagation for Semi-Supervised Speaker Identification | Jun 15, 2021 | Speaker IdentificationSpeaker Recognition | —Unverified | 0 |
| PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform | May 31, 2021 | Speaker IdentificationSpeaker Recognition | CodeCode Available | 0 |
| Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework | May 25, 2021 | Speaker Recognition | —Unverified | 0 |
| Improving Fairness in Speaker Recognition | Apr 29, 2021 | AttributeFairness | —Unverified | 0 |
| Exploring Deep Learning for Joint Audio-Visual Lip Biometrics | Apr 17, 2021 | Deep LearningSpeaker Recognition | CodeCode Available | 1 |
| Conditional independence for pretext task selection in Self-supervised speech representation learning | Apr 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Speaker embeddings by modeling channel-wise correlations | Apr 6, 2021 | Speaker RecognitionStyle Transfer | CodeCode Available | 1 |
| SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System | Apr 5, 2021 | Speaker RecognitionSpeaker Verification | CodeCode Available | 0 |
| Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition | Apr 5, 2021 | Speaker Recognition | CodeCode Available | 0 |
| EfficientTDNN: Efficient Architecture Search for Speaker Recognition | Mar 25, 2021 | Data AugmentationNetwork Pruning | CodeCode Available | 1 |
| Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining | Feb 16, 2021 | Audio ClassificationEvent Detection | —Unverified | 0 |
| Content-Aware Speaker Embeddings for Speaker Diarisation | Feb 12, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| U-vectors: Generating clusterable speaker embedding from unlabeled data | Feb 7, 2021 | Domain AdaptationSpeaker Recognition | CodeCode Available | 0 |
| Study of Pre-processing Defenses against Adversarial Attacks on State-of-the-art Speaker Recognition Systems | Jan 22, 2021 | Speaker Recognition | —Unverified | 0 |
| VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge | Dec 12, 2020 | Speaker Recognition | —Unverified | 0 |
| DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis | Dec 9, 2020 | Speaker RecognitionSpeech Synthesis | CodeCode Available | 0 |
| Speaker Recognition Based on Deep Learning: An Overview | Dec 2, 2020 | Deep LearningDomain Adaptation | —Unverified | 0 |
| Deep Discriminative Feature Learning for Accent Recognition | Nov 25, 2020 | Face RecognitionSpeaker Identification | CodeCode Available | 1 |
| Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech | Nov 24, 2020 | Data AugmentationSpeaker Recognition | —Unverified | 0 |
| An Empirical Study on Text-Independent Speaker Verification based on the GE2E Method | Nov 10, 2020 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| COVID-19 Patient Detection from Telephone Quality Speech Data | Nov 9, 2020 | SentenceSpeaker Recognition | CodeCode Available | 0 |
| Masked Proxy Loss For Text-Independent Speaker Verification | Nov 9, 2020 | Metric LearningSpeaker Recognition | CodeCode Available | 0 |
| Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020 | Nov 4, 2020 | Speaker RecognitionSpeaker Verification | —Unverified | 0 |
| ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 2020 | Nov 3, 2020 | Speaker Recognition | —Unverified | 0 |
| Speaker anonymisation using the McAdams coefficient | Nov 2, 2020 | Speaker Recognition | CodeCode Available | 1 |
| The xx205 System for the VoxCeleb Speaker Recognition Challenge 2020 | Oct 31, 2020 | Speaker Recognition | —Unverified | 0 |
| Adversarial defense for deep speaker recognition using hybrid adversarial training | Oct 30, 2020 | Adversarial DefenseSpeaker Recognition | —Unverified | 0 |
| Deep Speaker Vector Normalization with Maximum Gaussianality Training | Oct 30, 2020 | Speaker Recognition | CodeCode Available | 0 |
| Deep generative LDA | Oct 30, 2020 | Dimensionality ReductionSpeaker Recognition | CodeCode Available | 0 |
| The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20) | Oct 27, 2020 | Binary ClassificationSpeaker Recognition | —Unverified | 0 |
| CopyPaste: An Augmentation Method for Speech Emotion Recognition | Oct 27, 2020 | Data AugmentationEmotion Recognition | —Unverified | 0 |
| Leveraging speaker attribute information using multi task learning for speaker verification and diarization | Oct 27, 2020 | AttributeMulti-Task Learning | CodeCode Available | 1 |
| Unsupervised Learning of Disentangled Speech Content and Style Representation | Oct 24, 2020 | DecoderSpeaker Recognition | —Unverified | 0 |
| Momentum Contrast Speaker Representation Learning | Oct 22, 2020 | Contrastive LearningMetric Learning | —Unverified | 0 |
| The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge | Oct 22, 2020 | Action DetectionActivity Detection | —Unverified | 0 |
| Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium Learning | Oct 22, 2020 | Representation LearningSpeaker Recognition | CodeCode Available | 1 |
| Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge2020 | Oct 20, 2020 | Data AugmentationDenoising | —Unverified | 0 |
| Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition | Oct 13, 2020 | SentenceSpeaker Recognition | CodeCode Available | 0 |