SOTAVerified

Speaker Identification

Papers

Showing 101150 of 248 papers

TitleStatusHype
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
Pretraining Multi-Speaker Identification for Neural Speaker Diarization0
Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?0
Privacy-preserving Representation Learning for Speech Understanding0
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples0
Probing Self-supervised Learning Models with Target Speech Extraction0
Progressive Residual Extraction based Pre-training for Speech Representation Learning0
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation0
Quantized Approximate Signal Processing (QASP): Towards Homomorphic Encryption for audio0
Read, Look or Listen? What's Needed for Solving a Multimodal Dataset0
Reducing audio membership inference attack accuracy to chance: 4 defenses0
Remarks on Optimal Scores for Speaker Recognition0
Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling0
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems0
REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion0
Rhythm Features for Speaker Identification0
Robust Speaker Recognition Using Speech Enhancement And Attention Model0
SCDiar: a streaming diarization system based on speaker change detection and speech recognition0
Security and Privacy Problems in Voice Assistant Applications: A Survey0
Seeing Voices and Hearing Faces: Cross-modal biometric matching0
Significance of Chirp MFCC as a Feature in Speech and Audio Applications0
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information0
基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese]0
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features0
Speaker attribution with voice profiles by graph-based semi-supervised learning0
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones0
Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues0
Speaker Identification Experiments Under Gender De-Identification0
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG0
Speaker identification from the sound of the human breath0
Speaker Identification From Youtube Obtained Data0
Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs0
Speaker Identification using EEG0
Speaker Identification using Speech Recognition0
Speaker Recognition in Bengali Language from Nonlinear Features0
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition0
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention0
Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization0
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis0
Speech Unlearning0
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings0
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks0
Story Comprehension for Predicting What Happens Next0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations0
Streaming Multi-talker Speech Recognition with Joint Speaker Identification0
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals0
Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified