Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 435 papers

Title	Date	Tasks	Status	Hype	Score
Bias in Automated Speaker Recognition	Jan 24, 2022	BIG-bench Machine LearningFace Recognition	CodeCode Available	1	5
Fine-tuning wav2vec2 for speaker recognition	Sep 30, 2021	ClassificationSpeaker Recognition	CodeCode Available	1	5
EfficientTDNN: Efficient Architecture Search for Speaker Recognition	Mar 25, 2021	Data AugmentationNetwork Pruning	CodeCode Available	1	5
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech	Jul 12, 2020	Keyword SpottingSelf-Supervised Learning	CodeCode Available	1	5
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?	May 23, 2023	Caller DetectionSelf-Supervised Learning	CodeCode Available	0	5
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition Systems	Sep 14, 2023	Feature EngineeringInference Attack	CodeCode Available	0	5
Additive Margin SincNet for Speaker Recognition	Jan 28, 2019	Deep LearningSpeaker Recognition	CodeCode Available	0	5
PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform	May 31, 2021	Speaker IdentificationSpeaker Recognition	CodeCode Available	0	5
Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation	May 16, 2020	blind source separationData Augmentation	CodeCode Available	0	5
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments	Feb 26, 2020	Face RecognitionFew-Shot Learning	CodeCode Available	0	5
Robust speaker recognition using unsupervised adversarial invariance	Nov 3, 2019	speaker-diarizationSpeaker Diarization	CodeCode Available	0	5
Risk of re-identification for shared clinical speech recordings	Oct 18, 2022	Speaker Recognition	CodeCode Available	0	5
Baselines and Protocols for Household Speaker Recognition	Apr 30, 2022	Speaker Recognition	CodeCode Available	0	5
Prosody-Driven Privacy-Preserving Dementia Detection	Jul 3, 2024	AttributeDiagnostic	CodeCode Available	0	5
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states	Jul 9, 2024	ArticlesClassification	CodeCode Available	0	5
Private kNN-VC: Interpretable Anonymization of Converted Speech	May 23, 2025	Speaker anonymizationSpeaker Recognition	CodeCode Available	0	5
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders	Oct 25, 2019	General ClassificationRepresentation Learning	CodeCode Available	0	5
Inconsistency Ranking-based Noisy Label Detection for High-quality Data	Dec 1, 2022	Metric LearningSpeaker Recognition	CodeCode Available	0	5
Masked Proxy Loss For Text-Independent Speaker Verification	Nov 9, 2020	Metric LearningSpeaker Recognition	CodeCode Available	0	5
Attention-Based Models for Text-Dependent Speaker Verification	Oct 28, 2017	Image CaptioningMachine Translation	CodeCode Available	0	5
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health	Feb 8, 2023	Speaker Recognition	CodeCode Available	0	5
Personal VAD: Speaker-Conditioned Voice Activity Detection	Aug 12, 2019	Action DetectionActivity Detection	CodeCode Available	0	5
Improving fairness in speaker verification via Group-adapted Fusion Network	Feb 23, 2022	FairnessSpeaker Recognition	CodeCode Available	0	5
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition	Nov 15, 2022	AllEmotion Classification	CodeCode Available	0	5
Filterbank design for end-to-end speech separation	Oct 23, 2019	Speaker RecognitionSpeech Separation	CodeCode Available	0	5

Show:10 25 50

← PrevPage 3 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	w2v2-aam	EER	1.88	—	Unverified
2	WavLM+ECAPA-TDNN	EER	0.39	—	Unverified