SOTAVerified

Speaker Identification

Papers

Showing 101125 of 248 papers

TitleStatusHype
基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese]0
A Preliminary Exploration with GPT-4o Voice Mode0
A Real-time Speaker Diarization System Based on Spatial Spectrum0
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions0
A Study of Few-Shot Audio Classification0
A Survey on Paralinguistics in Tamil Speech Processing0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
A user study to compare two conversational assistants designed for people with hearing impairments0
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification0
Can Musical Emotion Be Quantified With Neural Jitter Or Shimmer? A Novel EEG Based Study With Hindustani Classical Music0
CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Comparison of Gender- and Speaker-adaptive Emotion Recognition0
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification0
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections0
Computing with Hypervectors for Efficient Speaker Identification0
Cosine similarity-based adversarial process0
Cross-Lingual Speaker Identification from Weak Local Evidence0
Curie: A method for protecting SVM Classifier from Poisoning Attack0
DASB -- Discrete Audio and Speech Benchmark0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models0
Delving into VoxCeleb: environment invariant speaker recognition0
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods0
Show:102550
← PrevPage 5 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified