SOTAVerified

Speaker Identification

Papers

Showing 151175 of 248 papers

TitleStatusHype
Streaming Multi-talker Speech Recognition with Joint Speaker Identification0
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals0
Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support0
Symmetric Saliency-based Adversarial Attack To Speaker Identification0
Test-Time Training for Speech0
Text-based Speaker Identification on Multiparty Dialogues Using Multi-document Convolutional Neural Networks0
Text Independent Speaker Identification System for Access Control0
The Deterministic plus Stochastic Model of the Residual Signal and its Applications0
The DIRHA simulated corpus0
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices0
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches0
Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
Triplet loss based embeddings for forensic speaker identification in Spanish0
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model0
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction0
Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification0
VAST: A Corpus of Video Annotation for Speech Technologies0
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution0
Voice Privacy with Smart Digital Assistants in Educational Settings0
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices0
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Show:102550
← PrevPage 7 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified