SOTAVerified|Agents Browse Leaderboard About

Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 111–120 of 248 papers

Title	Date	Tasks	Status	Hype
Exploring VQ-VAE with Prosody Parameters for Speaker Anonymization	Sep 24, 2024	DecoderSpeaker anonymization	—Unverified	0
基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese]	Nov 1, 2017	Speaker Identification	—Unverified	0
Experiments on Open-Set Speaker Identification with Discriminatively Trained Neural Networks	Apr 2, 2019	Speaker Identification	—Unverified	0
Invited Talk: IBM Cognitive Computing - An NLP Renaissance!	Oct 1, 2014	Machine TranslationQuestion Answering	—Unverified	0
ExARN: self-attending RNN for target speaker extraction	Dec 2, 2022	Speaker IdentificationTarget Speaker Extraction	—Unverified	0
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers	Jun 19, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Karaoker: Alignment-free singing voice synthesis with speech training data	Apr 8, 2022	Singing Voice SynthesisSpeaker Identification	—Unverified	0
Large-Scale Speaker Diarization of Radio Broadcast Archives	Jun 19, 2019	speaker-diarizationSpeaker Diarization	—Unverified	0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models	Jan 24, 2025	Emotion ClassificationSpeaker Identification	—Unverified	0
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement	Mar 3, 2024	Automatic Speech RecognitionKeyword Spotting	—Unverified	0

Show:10 25 50

← PrevPage 12 of 25Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified