Speaker Verification

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 746 papers

Title	Date	Tasks	Status	Hype	Score
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit	May 20, 2022	AllAutomatic Speech Recognition (ASR)	CodeCode Available	6	5
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification Benchmark	Jul 16, 2024	DiversitySpeaker Identification	CodeCode Available	5	5
Pushing the limits of raw waveform speaker recognition	Mar 16, 2022	Self-Supervised LearningSpeaker Recognition	CodeCode Available	3	5
Ludwig: a type-based declarative deep learning toolbox	Sep 17, 2019	DecoderDeep Learning	CodeCode Available	3	5
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models	Jan 30, 2024	Self-Supervised LearningSpeaker Recognition	CodeCode Available	3	5
SALMONN: Towards Generic Hearing Abilities for Large Language Models	Oct 20, 2023	Audio captioningAutomatic Speech Recognition	CodeCode Available	3	5
Magnitude-aware Probabilistic Speaker Embeddings	Feb 28, 2022	Out-of-Distribution DetectionSpeaker Verification	CodeCode Available	3	5
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification	Dec 6, 2023	AllSpeaker Verification	CodeCode Available	3	5
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality	Jul 14, 2022	Speaker Verificationspeech-recognition	CodeCode Available	2	5
Towards A Unified Conformer Structure: from ASR to ASV Task	Nov 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2	5
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT	May 15, 2022	Representation LearningSpeaker Verification	CodeCode Available	2	5
Singer Identity Representation Learning using Self-Supervised Techniques	Jan 10, 2024	Domain GeneralizationRepresentation Learning	CodeCode Available	2	5
Cross-modal information fusion for voice spoofing detection	Feb 1, 2023	Automatic Speech Recognitionfake voice detection	CodeCode Available	1	5
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing	Mar 18, 2022	Representation LearningSpeaker Verification	CodeCode Available	1	5
Generalized End-to-End Loss for Speaker Verification	Oct 28, 2017	Domain AdaptationSpeaker Verification	CodeCode Available	1	5
FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With Attention	Oct 27, 2020	DisentanglementSpeaker Verification	CodeCode Available	1	5
Extended U-Net for Speaker Verification in Noisy Environments	Jun 27, 2022	DenoisingSpeaker Identification	CodeCode Available	1	5
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint	May 10, 2020	Speaker VerificationSpeech Synthesis	CodeCode Available	1	5
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw Waveforms	Apr 1, 2020	Speaker VerificationText-Independent Speaker Verification	CodeCode Available	1	5
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentation	Feb 24, 2022	Audio Deepfake DetectionData Augmentation	CodeCode Available	1	5
End-to-end anti-spoofing with RawNet2	Nov 2, 2020	Audio Deepfake DetectionSpeaker Verification	CodeCode Available	1	5
Evaluation of Speech Representations for MOS prediction	Jun 16, 2023	PredictionSelf-Supervised Learning	CodeCode Available	1	5
CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations	Sep 1, 2021	Emotion ClassificationLanguage Modeling	CodeCode Available	1	5
ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification	Jan 10, 2025	Speaker Verification	CodeCode Available	1	5
FastAudio: A Learnable Audio Front-End for Spoof Speech Detection	Sep 6, 2021	Speaker IdentificationSpeaker Verification	CodeCode Available	1	5
FilterAugment: An Acoustic Environmental Data Augmentation Method	Oct 7, 2021	Data AugmentationEvent Detection	CodeCode Available	1	5
Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection	Jun 25, 2020	Binary ClassificationSpeaker Verification	CodeCode Available	1	5
CryCeleb: A Speaker Verification Dataset Based on Infant Cry Sounds	May 1, 2023	Speaker Verification	CodeCode Available	1	5
Crossed-Time Delay Neural Network for Speaker Recognition	May 31, 2020	Speaker RecognitionSpeaker Verification	CodeCode Available	1	5
Cross-modal Audio-visual Co-learning for Text-independent Speaker Verification	Feb 22, 2023	Speaker VerificationText-Independent Speaker Verification	CodeCode Available	1	5
Backdoor Attack against Speaker Verification	Oct 22, 2020	Backdoor AttackClustering	CodeCode Available	1	5
Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof Detection	Sep 5, 2021	Speaker VerificationSpoof Detection	CodeCode Available	1	5
Explainable deepfake and spoofing detection: an attack analysis using SHapley Additive exPlanations	Feb 28, 2022	Face SwappingSpeaker Verification	CodeCode Available	1	5
Disentanglement in a GAN for Unconditional Speech Synthesis	Jul 4, 2023	DisentanglementGenerative Adversarial Network	CodeCode Available	1	5
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning	Feb 6, 2020	Reinforcement LearningSpeaker Verification	CodeCode Available	1	5
An Unsupervised Autoregressive Model for Speech Representation Learning	Apr 5, 2019	General Classificationmodel	CodeCode Available	1	5
Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models	Sep 14, 2023	Speaker VerificationSpeech Enhancement	CodeCode Available	1	5
A Fully Tensorized Recurrent Neural Network	Oct 8, 2020	image-classificationImage Classification	CodeCode Available	1	5
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection	Apr 14, 2019	Speaker Verification	CodeCode Available	1	5
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances	Apr 4, 2021	Speaker Verification	CodeCode Available	1	5
Attack on practical speaker verification system using universal adversarial perturbations	May 19, 2021	Real-World Adversarial AttackRoom Impulse Response (RIR)	CodeCode Available	1	5
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation Plan	Sep 1, 2021	Face SwappingSpeaker Verification	CodeCode Available	1	5
Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer Learning	Aug 8, 2020	Speaker VerificationTransfer Learning	CodeCode Available	1	5
AutoSpeech: Neural Architecture Search for Speaker Recognition	May 7, 2020	image-classificationImage Classification	CodeCode Available	1	5
DropClass and DropAdapt: Dropping classes for deep speaker representation learning	Feb 2, 2020	General ClassificationRepresentation Learning	CodeCode Available	1	5
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake Detection	Jul 27, 2021	Audio Deepfake DetectionDeepFake Detection	CodeCode Available	1	5
Bts-e: Audio deepfake detection using breathing-talking-silence encoder	May 5, 2023	Audio Deepfake DetectionDeepFake Detection	CodeCode Available	1	5
Cross-Age Speaker Verification: Learning Age-Invariant Speaker Embeddings	Jul 13, 2022	Age EstimationSpeaker Verification	CodeCode Available	1	5
Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks	Jul 19, 2021	Speaker Verification	CodeCode Available	1	5
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems	Apr 3, 2021	Data AugmentationMulti-Task Learning	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 15Next →

All datasets VoxCeleb VoxCeleb1 CALLHOME CN-CELEB ASVspoof 2019 - LA VibraVox (forehead accelerometer)VibraVox (headset microphone)VibraVox (rigid in-ear microphone)VibraVox (soft in-ear microphone)VibraVox (temple vibration pickup)VibraVox (throat microphone)VoxCeleb2

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Multi Task SSL	EER	1.98	—	Unverified
2	ReDimNet-B0-LM (1.0M)	EER	1.16	—	Unverified
3	TitanNet -S	EER	1.15	—	Unverified
4	ReDimNet-B0-LM-ASNorm (1.0M)	EER	1.07	—	Unverified
5	SpeechNAS	EER	1.02	—	Unverified
6	ReDimNet-B1-LM (2.2M)	EER	0.85	—	Unverified
7	TitanNet -M	EER	0.81	—	Unverified
8	ReDimNet-B1-LM-ASNorm (2.2M)	EER	0.73	—	Unverified
9	TitanNet -L	EER	0.68	—	Unverified
10	ReDimNet-B2-SF2-LM (4.7M)	EER	0.57	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fine-tuned HuBERT Large	EER	2.36	—	Unverified
2	ReDimNet-B0-LM (1.0M)	EER	1.16	—	Unverified
3	ReDimNet-B0-LM-ASNorm (1.0M)	EER	1.07	—	Unverified
4	SpeechNAS	EER	1.02	—	Unverified
5	ReDimNet-B1-LM (2.2M)	EER	0.85	—	Unverified
6	ReDimNet-B1-LM-ASNorm (2.2M)	EER	0.73	—	Unverified
7	ReDimNet-B2-SF2-LM (4.7M)	EER	0.57	—	Unverified
8	ReDimNet-B2-SF2-LM-ASNorm (4.7M)	EER	0.52	—	Unverified
9	ReDimNet-B4-LM (6.3M)	EER	0.51	—	Unverified
10	ReDimNet-B3-LM (3.0M)	EER	0.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GE2E	Cosine EER	3.55	—	Unverified
2		Cosine EER	2.38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet with Attention Backend	EER	10.77	—	Unverified
2	X-Vectors with Attention Backend	EER	10.12	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA-TDNN	minDCF	0	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA2	Test EER	0.01	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA2	Test EER	0	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA2	Test EER	0.03	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA2	Test EER	0.02	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA2	Test EER	0.08	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ECAPA2	Test EER	0.04	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ResNet-50	EER	100	—	Unverified