SOTAVerified|Agents Browse Leaderboard About Blog

Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 248 papers

Title	Date	Tasks	Status	Hype	Score
AM-MobileNet1D: A Portable Model for Speaker Recognition	Mar 31, 2020	Deep Learningmodel	CodeCode Available	1	5
A Modulation-Domain Loss for Neural-Network-based Real-time Speech Enhancement	Feb 15, 2021	Speaker IdentificationSpeech Denoising	CodeCode Available	1	5
GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation Understanding	May 16, 2023	Speaker Identification	CodeCode Available	1	5
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam	Jan 23, 2020	Speaker IdentificationSpeech Extraction	CodeCode Available	1	5
FastAudio: A Learnable Audio Front-End for Spoof Speech Detection	Sep 6, 2021	Speaker IdentificationSpeaker Verification	CodeCode Available	1	5
Disentangling Textual and Acoustic Features of Neural Speech Representations	Oct 3, 2024	DisentanglementEmotion Recognition	CodeCode Available	1	5
End-to-End Chinese Speaker Identification	Jul 1, 2022	coreference-resolutionCoreference Resolution	CodeCode Available	1	5
ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event Classification	Nov 23, 2022	Keyword SpottingSelf-Supervised Learning	CodeCode Available	1	5
AutoSpeech: Neural Architecture Search for Speaker Recognition	May 7, 2020	image-classificationImage Classification	CodeCode Available	1	5
ATST: Audio Representation Learning with Teacher-Student Transformer	Apr 26, 2022	Audio ClassificationInstrument Recognition	CodeCode Available	1	5

Show:10 25 50

← PrevPage 2 of 25Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified