Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 435 papers

Title	Date	Tasks	Status	Score
Prosody-Driven Privacy-Preserving Dementia Detection	Jul 3, 2024	AttributeDiagnostic	CodeCode Available	5
COVID-19 Patient Detection from Telephone Quality Speech Data	Nov 9, 2020	SentenceSpeaker Recognition	CodeCode Available	5
Pretext Tasks selection for multitask self-supervised speech representation learning	Jul 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	5
Personal VAD: Speaker-Conditioned Voice Activity Detection	Aug 12, 2019	Action DetectionActivity Detection	CodeCode Available	5
Private kNN-VC: Interpretable Anonymization of Converted Speech	May 23, 2025	Speaker anonymizationSpeaker Recognition	CodeCode Available	5
Attention-Based Models for Text-Dependent Speaker Verification	Oct 28, 2017	Image CaptioningMachine Translation	CodeCode Available	5
Risk of re-identification for shared clinical speech recordings	Oct 18, 2022	Speaker Recognition	CodeCode Available	5
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders	Oct 25, 2019	General ClassificationRepresentation Learning	CodeCode Available	5
Conditional independence for pretext task selection in Self-supervised speech representation learning	Apr 15, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	5
Inconsistency Ranking-based Noisy Label Detection for High-quality Data	Dec 1, 2022	Metric LearningSpeaker Recognition	CodeCode Available	5
Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent Space	Aug 21, 2021	Face RecognitionSpeaker Recognition	CodeCode Available	5
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile Health	Feb 8, 2023	Speaker Recognition	CodeCode Available	5
CoLMbo: Speaker Language Model for Descriptive Profiling	Jun 11, 2025	DescriptiveLanguage Modeling	CodeCode Available	5
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks	Oct 1, 2019	Speaker IdentificationSpeaker Recognition	CodeCode Available	5
Masked Proxy Loss For Text-Independent Speaker Verification	Nov 9, 2020	Metric LearningSpeaker Recognition	CodeCode Available	5
CN-CELEB: a challenging Chinese speaker recognition dataset	Oct 31, 2019	Speaker Recognition	CodeCode Available	5
Filterbank design for end-to-end speech separation	Oct 23, 2019	Speaker RecognitionSpeech Separation	CodeCode Available	5
Improving fairness in speaker verification via Group-adapted Fusion Network	Feb 23, 2022	FairnessSpeaker Recognition	CodeCode Available	5
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis	Dec 9, 2020	Speaker RecognitionSpeech Synthesis	CodeCode Available	5
Deep Speaker Vector Normalization with Maximum Gaussianality Training	Oct 30, 2020	Speaker Recognition	CodeCode Available	5
Certification of Speaker Recognition Models to Additive Perturbations	Apr 29, 2024	Few-Shot LearningSpeaker Recognition	CodeCode Available	5
Deep Speaker: an End-to-End Neural Speaker Embedding System	May 5, 2017	ClusteringSpeaker Identification	CodeCode Available	5
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker Recognition	Nov 15, 2022	AllEmotion Classification	CodeCode Available	5
Centroid-based deep metric learning for speaker recognition	Feb 6, 2019	Few-Shot Image ClassificationFew-Shot Learning	—Unverified	0
CEC: A Noisy Label Detection Method for Speaker Recognition	Jun 19, 2024	Speaker RecognitionSpeaker Verification	—Unverified	0

Show:10 25 50

← PrevPage 4 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	w2v2-aam	EER	1.88	—	Unverified
2	WavLM+ECAPA-TDNN	EER	0.39	—	Unverified