SOTAVerified

Speaker Identification

Papers

Showing 126150 of 248 papers

TitleStatusHype
Efficiency-oriented approaches for self-supervised speech representation learning0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings0
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis0
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification0
End-to-End Speaker-Attributed ASR with Transformer0
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample0
Streaming Multi-talker Speech Recognition with Joint Speaker Identification0
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals0
Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support0
Symmetric Saliency-based Adversarial Attack To Speaker Identification0
Test-Time Training for Speech0
Text-based Speaker Identification on Multiparty Dialogues Using Multi-document Convolutional Neural Networks0
Text Independent Speaker Identification System for Access Control0
The Deterministic plus Stochastic Model of the Residual Signal and its Applications0
The DIRHA simulated corpus0
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices0
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches0
Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
Triplet loss based embeddings for forensic speaker identification in Spanish0
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model0
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction0
Show:102550
← PrevPage 6 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified