Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 435 papers

Title	Date	Tasks	Status	Hype	Score
Bias in Automated Speaker Recognition	Jan 24, 2022	BIG-bench Machine LearningFace Recognition	CodeCode Available	1	5
Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition	Jun 7, 2022	Speaker Recognitionspeech-recognition	CodeCode Available	1	5
Universal Adversarial Perturbations Generative Network for Speaker Recognition	Apr 7, 2020	Speaker Recognition	CodeCode Available	1	5
NPLDA: A Deep Neural PLDA Model for Speaker Verification	Feb 10, 2020	Speaker RecognitionSpeaker Verification	CodeCode Available	1	5
VoxCeleb2: Deep Speaker Recognition	Jun 14, 2018	Speaker RecognitionSpeaker Verification	CodeCode Available	0	5
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?	May 23, 2023	Caller DetectionSelf-Supervised Learning	CodeCode Available	0	5
U-vectors: Generating clusterable speaker embedding from unlabeled data	Feb 7, 2021	Domain AdaptationSpeaker Recognition	CodeCode Available	0	5
Version Control of Speaker Recognition Systems	Jul 23, 2020	Speaker Recognition	CodeCode Available	0	5
Additive Margin SincNet for Speaker Recognition	Jan 28, 2019	Deep LearningSpeaker Recognition	CodeCode Available	0	5
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument sounds	Jul 24, 2021	Data AugmentationInstrument Recognition	CodeCode Available	0	5
Vocal Style Factorization for Effective Speaker Recognition in Affective Scenarios	May 13, 2023	Speaker Recognition	CodeCode Available	0	5
Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems	Nov 3, 2019	Adversarial AttackSpeaker Recognition	CodeCode Available	0	5
Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation	May 16, 2020	blind source separationData Augmentation	CodeCode Available	0	5
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments	Feb 26, 2020	Face RecognitionFew-Shot Learning	CodeCode Available	0	5
Unified Hypersphere Embedding for Speaker Recognition	Jul 22, 2018	Speaker RecognitionText-Independent Speaker Recognition	CodeCode Available	0	5
Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition	Oct 13, 2020	SentenceSpeaker Recognition	CodeCode Available	0	5
Excitement Surfeited Turns to Errors: Deep Learning Testing Framework Based on Excitable Neurons	Feb 12, 2022	image-classificationImage Classification	CodeCode Available	0	5
Baselines and Protocols for Household Speaker Recognition	Apr 30, 2022	Speaker Recognition	CodeCode Available	0	5
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states	Jul 9, 2024	ArticlesClassification	CodeCode Available	0	5
Deep Normalization for Speaker Vectors	Apr 7, 2020	Speaker Recognition	CodeCode Available	0	5
To train or not to train adversarially: A study of bias mitigation strategies for speaker recognition	Mar 17, 2022	Face RecognitionFairness	CodeCode Available	0	5
Deep generative LDA	Oct 30, 2020	Dimensionality ReductionSpeaker Recognition	CodeCode Available	0	5
PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform	May 31, 2021	Speaker IdentificationSpeaker Recognition	CodeCode Available	0	5
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems	Sep 14, 2023	Feature EngineeringInference Attack	CodeCode Available	0	5
Robust speaker recognition using unsupervised adversarial invariance	Nov 3, 2019	speaker-diarizationSpeaker Diarization	CodeCode Available	0	5
Prosody-Driven Privacy-Preserving Dementia Detection	Jul 3, 2024	AttributeDiagnostic	CodeCode Available	0	5
COVID-19 Patient Detection from Telephone Quality Speech Data	Nov 9, 2020	SentenceSpeaker Recognition	CodeCode Available	0	5
Pretext Tasks selection for multitask self-supervised speech representation learning	Jul 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0	5
Personal VAD: Speaker-Conditioned Voice Activity Detection	Aug 12, 2019	Action DetectionActivity Detection	CodeCode Available	0	5
Private kNN-VC: Interpretable Anonymization of Converted Speech	May 23, 2025	Speaker anonymizationSpeaker Recognition	CodeCode Available	0	5
Attention-Based Models for Text-Dependent Speaker Verification	Oct 28, 2017	Image CaptioningMachine Translation	CodeCode Available	0	5
Risk of re-identification for shared clinical speech recordings	Oct 18, 2022	Speaker Recognition	CodeCode Available	0	5
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders	Oct 25, 2019	General ClassificationRepresentation Learning	CodeCode Available	0	5
Conditional independence for pretext task selection in Self-supervised speech representation learning	Apr 15, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0	5
Inconsistency Ranking-based Noisy Label Detection for High-quality Data	Dec 1, 2022	Metric LearningSpeaker Recognition	CodeCode Available	0	5
Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent Space	Aug 21, 2021	Face RecognitionSpeaker Recognition	CodeCode Available	0	5
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health	Feb 8, 2023	Speaker Recognition	CodeCode Available	0	5
CoLMbo: Speaker Language Model for Descriptive Profiling	Jun 11, 2025	DescriptiveLanguage Modeling	CodeCode Available	0	5
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks	Oct 1, 2019	Speaker IdentificationSpeaker Recognition	CodeCode Available	0	5
Masked Proxy Loss For Text-Independent Speaker Verification	Nov 9, 2020	Metric LearningSpeaker Recognition	CodeCode Available	0	5
CN-CELEB: a challenging Chinese speaker recognition dataset	Oct 31, 2019	Speaker Recognition	CodeCode Available	0	5
Filterbank design for end-to-end speech separation	Oct 23, 2019	Speaker RecognitionSpeech Separation	CodeCode Available	0	5
Improving fairness in speaker verification via Group-adapted Fusion Network	Feb 23, 2022	FairnessSpeaker Recognition	CodeCode Available	0	5
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis	Dec 9, 2020	Speaker RecognitionSpeech Synthesis	CodeCode Available	0	5
Deep Speaker Vector Normalization with Maximum Gaussianality Training	Oct 30, 2020	Speaker Recognition	CodeCode Available	0	5
Certification of Speaker Recognition Models to Additive Perturbations	Apr 29, 2024	Few-Shot LearningSpeaker Recognition	CodeCode Available	0	5
Deep Speaker: an End-to-End Neural Speaker Embedding System	May 5, 2017	ClusteringSpeaker Identification	CodeCode Available	0	5
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition	Nov 15, 2022	AllEmotion Classification	CodeCode Available	0	5
Centroid-based deep metric learning for speaker recognition	Feb 6, 2019	Few-Shot Image ClassificationFew-Shot Learning	—Unverified	0	0
CEC: A Noisy Label Detection Method for Speaker Recognition	Jun 19, 2024	Speaker RecognitionSpeaker Verification	—Unverified	0	0

Show:10 25 50

← PrevPage 2 of 9Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	w2v2-aam	EER	1.88	—	Unverified
2	WavLM+ECAPA-TDNN	EER	0.39	—	Unverified