SOTAVerified|Agents Browse Leaderboard About

Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 91–100 of 248 papers

Title	Date	Tasks	Status	Hype
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre	Jun 29, 2022	DisentanglementSpeaker Identification	—Unverified	0
Extended U-Net for Speaker Verification in Noisy Environments	Jun 27, 2022	DenoisingSpeaker Identification	CodeCode Available	1
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems	Jun 18, 2022	Speaker IdentificationSpeaker Verification	—Unverified	0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations	Jun 16, 2022	Speaker IdentificationSpeech Extraction	—Unverified	0
Speaker Identification using Speech Recognition	May 29, 2022	Speaker Identificationspeech-recognition	—Unverified	0
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit	May 20, 2022	AllAutomatic Speech Recognition (ASR)	CodeCode Available	6
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information	May 8, 2022	Self-Supervised LearningSpeaker Identification	—Unverified	0
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution	May 6, 2022	BenchmarkingSpeaker Identification	—Unverified	0
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Apr 28, 2022	Speaker IdentificationSpeaker Verification	CodeCode Available	0
ATST: Audio Representation Learning with Teacher-Student Transformer	Apr 26, 2022	Audio ClassificationInstrument Recognition	CodeCode Available	1

Show:10 25 50

← PrevPage 10 of 25Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified