SOTAVerified

Speaker Identification

Papers

Showing 126150 of 248 papers

TitleStatusHype
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and IdentificationCode0
Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Listen only to me! How well can target speech extraction handle false alarms?0
Karaoker: Alignment-free singing voice synthesis with speech training data0
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification0
Improved Relation Networks for End-to-End Speaker Verification and Identification0
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification0
Speaker Identification Experiments Under Gender De-Identification0
On the relevance of bandwidth extension for speaker identification0
openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer0
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings0
Tubes Among Us: Analog Attack on Automatic Speaker Identification0
Cross-Lingual Speaker Identification from Weak Local Evidence0
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices0
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification0
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation ExtractionCode0
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker IdentificationCode0
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets0
Towards Making the Most of Dialogue Characteristics for Neural Chat TranslationCode0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
A Real-time Speaker Diarization System Based on Spatial Spectrum0
Show:102550
← PrevPage 6 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified