SOTAVerified

Speaker Identification

Papers

Showing 151200 of 248 papers

TitleStatusHype
VAST: A Corpus of Video Annotation for Speech Technologies0
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution0
Voice Privacy with Smart Digital Assistants in Educational Settings0
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices0
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification0
Weakly Supervised Training of Speaker Identification Models0
Matics Software Suite: New Tools for Evaluation and Data Exploration0
MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification0
Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models0
Multi-Label Training for Text-Independent Speaker Identification0
Multi-Modal Emotion Detection with Transfer Learning0
Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition0
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification0
Hearing-Loss Compensation Using Deep Neural Networks: A Framework and Results From a Listening Test0
Neural Predictive Coding using Convolutional Neural Networks towards Unsupervised Learning of Speaker Characteristics0
On the relevance of bandwidth extension for speaker identification0
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications0
On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels0
openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer0
Ordered and Binary Speaker Embedding0
Tubes Among Us: Analog Attack on Automatic Speaker Identification0
PolInterviews -- A Dataset of German Politician Public Broadcast Interviews0
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
Pretraining Multi-Speaker Identification for Neural Speaker Diarization0
Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?0
Privacy-preserving Representation Learning for Speech Understanding0
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples0
Probing Self-supervised Learning Models with Target Speech Extraction0
Progressive Residual Extraction based Pre-training for Speech Representation Learning0
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation0
Quantized Approximate Signal Processing (QASP): Towards Homomorphic Encryption for audio0
Read, Look or Listen? What's Needed for Solving a Multimodal Dataset0
Reducing audio membership inference attack accuracy to chance: 4 defenses0
Remarks on Optimal Scores for Speaker Recognition0
Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling0
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems0
REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion0
Rhythm Features for Speaker Identification0
Robust Speaker Recognition Using Speech Enhancement And Attention Model0
SCDiar: a streaming diarization system based on speaker change detection and speech recognition0
Security and Privacy Problems in Voice Assistant Applications: A Survey0
Seeing Voices and Hearing Faces: Cross-modal biometric matching0
Significance of Chirp MFCC as a Feature in Speech and Audio Applications0
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information0
基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese]0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified