SOTAVerified

Speaker Identification

Papers

Showing 4150 of 248 papers

TitleStatusHype
Speech Resynthesis from Discrete Disentangled Self-Supervised RepresentationsCode1
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker EmbeddingsCode1
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languagesCode1
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeamCode1
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker IdentificationCode0
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio RepresentationCode0
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenarioCode0
Cross-Lingual Speaker Identification Using Distant SupervisionCode0
On Learning Associations of Faces and VoicesCode0
Contrastive Learning of General-Purpose Audio RepresentationsCode0
Show:102550
← PrevPage 5 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified