SOTAVerified

Speaker Identification

Papers

Showing 201225 of 248 papers

TitleStatusHype
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Comparison of Gender- and Speaker-adaptive Emotion Recognition0
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification0
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections0
Computing with Hypervectors for Efficient Speaker Identification0
Cosine similarity-based adversarial process0
Cross-Lingual Speaker Identification from Weak Local Evidence0
Curie: A method for protecting SVM Classifier from Poisoning Attack0
DASB -- Discrete Audio and Speech Benchmark0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models0
Delving into VoxCeleb: environment invariant speaker recognition0
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods0
Efficiency-oriented approaches for self-supervised speech representation learning0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings0
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis0
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification0
End-to-End Speaker-Attributed ASR with Transformer0
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample0
Ensemble knowledge distillation of self-supervised speech models0
Evaluating Speaker Identity Coding in Self-supervised Models and Humans0
Evaluation of Automatic Formant Trackers0
ExARN: self-attending RNN for target speaker extraction0
Show:102550
← PrevPage 9 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified