SOTAVerified|Agents Browse Leaderboard About Blog

Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–50 of 248 papers

Title	Date	Tasks	Status	Hype	Score
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations	Apr 1, 2021	DisentanglementRepresentation Learning	CodeCode Available	1	5
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker Embeddings	Mar 13, 2025	Speaker Identificationspeech-recognition	CodeCode Available	1	5
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages	Aug 24, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam	Jan 23, 2020	Speaker IdentificationSpeech Extraction	CodeCode Available	1	5
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification	Aug 22, 2023	Self-Supervised LearningSpeaker Identification	CodeCode Available	0	5
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation	May 18, 2020	Self-Supervised LearningSpeaker Identification	CodeCode Available	0	5
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario	Jan 7, 2021	Multi-Task LearningSpeaker Identification	CodeCode Available	0	5
Cross-Lingual Speaker Identification Using Distant Supervision	Oct 11, 2022	Language ModelingLanguage Modelling	CodeCode Available	0	5
On Learning Associations of Faces and Voices	May 15, 2018	Speaker Identification	CodeCode Available	0	5
Contrastive Learning of General-Purpose Audio Representations	Oct 21, 2020	CoLAContrastive Learning	CodeCode Available	0	5

Show:10 25 50

← PrevPage 5 of 25Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified