SOTAVerified

Speaker Identification

Papers

Showing 101150 of 248 papers

TitleStatusHype
A Study of Few-Shot Audio Classification0
A Survey on Paralinguistics in Tamil Speech Processing0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
A user study to compare two conversational assistants designed for people with hearing impairments0
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification0
Can Musical Emotion Be Quantified With Neural Jitter Or Shimmer? A Novel EEG Based Study With Hindustani Classical Music0
CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
Comparison of Gender- and Speaker-adaptive Emotion Recognition0
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification0
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections0
Computing with Hypervectors for Efficient Speaker Identification0
Cosine similarity-based adversarial process0
Cross-Lingual Speaker Identification from Weak Local Evidence0
Curie: A method for protecting SVM Classifier from Poisoning Attack0
DASB -- Discrete Audio and Speech Benchmark0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models0
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods0
Efficiency-oriented approaches for self-supervised speech representation learning0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings0
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis0
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification0
End-to-End Speaker-Attributed ASR with Transformer0
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample0
Ensemble knowledge distillation of self-supervised speech models0
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks0
Story Comprehension for Predicting What Happens Next0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations0
Streaming Multi-talker Speech Recognition with Joint Speaker Identification0
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals0
Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support0
Symmetric Saliency-based Adversarial Attack To Speaker Identification0
Test-Time Training for Speech0
Text-based Speaker Identification on Multiparty Dialogues Using Multi-document Convolutional Neural Networks0
Text Independent Speaker Identification System for Access Control0
The Deterministic plus Stochastic Model of the Residual Signal and its Applications0
The DIRHA simulated corpus0
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices0
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches0
Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
Triplet loss based embeddings for forensic speaker identification in Spanish0
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model0
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction0
Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified