SOTAVerified

Speaker Identification

Papers

Showing 151200 of 248 papers

TitleStatusHype
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings0
A Study of Few-Shot Audio Classification0
Deep Discriminative Feature Learning for Accent RecognitionCode1
How Far Are We from Robust Voice Conversion: A Survey0
FoolHD: Fooling speaker identification by Highly imperceptible adversarial DisturbancesCode1
Multi-Modal Emotion Detection with Transfer Learning0
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASRCode1
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model0
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakersCode0
Contrastive Learning of General-Purpose Audio RepresentationsCode0
A Lightweight Speaker Recognition System Using Timbre Properties0
Remarks on Optimal Scores for Speaker Recognition0
Sum-Product Networks for Robust Automatic Speaker IdentificationCode1
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker RecordingsCode1
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers0
Integrated Replay Spoofing-aware Text-independent Speaker Verification0
audino: A Modern Annotation Tool for Audio and SpeechCode2
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features0
Identify Speakers in Cocktail Parties with End-to-End AttentionCode0
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio RepresentationCode0
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification0
AutoSpeech: Neural Architecture Search for Speaker RecognitionCode1
Speaker Recognition in Bengali Language from Nonlinear Features0
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length PairsCode1
AM-MobileNet1D: A Portable Model for Speaker RecognitionCode1
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data0
Speaker Identification using EEG0
Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition0
Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition ModelsCode1
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention0
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeamCode1
Robust Speaker Recognition Using Speech Enhancement And Attention Model0
Supervised Speaker Embedding De-Mixing in Two-Speaker Environment0
The Deterministic plus Stochastic Model of the Residual Signal and its Applications0
Advances in Online Audio-Visual Meeting Transcription0
Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?0
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals0
Reducing audio membership inference attack accuracy to chance: 4 defenses0
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors0
Delving into VoxCeleb: environment invariant speaker recognitionCode0
Generative Pre-Training for Speech with Autoregressive Predictive CodingCode1
Word-level Embeddings for Cross-Task Transfer Learning in Speech ProcessingCode0
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model0
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networksCode0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model0
Cosine similarity-based adversarial process0
Large-Scale Speaker Diarization of Radio Broadcast Archives0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified