SOTAVerified

Speaker Identification

Papers

Showing 151200 of 248 papers

TitleStatusHype
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems0
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus0
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition0
Graph-based Label Propagation for Semi-Supervised Speaker Identification0
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings0
Streaming Multi-talker Speech Recognition with Joint Speaker Identification0
End-to-End Speaker-Attributed ASR with Transformer0
A Survey on Paralinguistics in Tamil Speech Processing0
Voice Privacy with Smart Digital Assistants in Educational Settings0
Triplet loss based embeddings for forensic speaker identification in Spanish0
CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions0
Speaker attribution with voice profiles by graph-based semi-supervised learning0
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenarioCode0
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings0
A Study of Few-Shot Audio Classification0
How Far Are We from Robust Voice Conversion: A Survey0
Multi-Modal Emotion Detection with Transfer Learning0
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model0
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakersCode0
Contrastive Learning of General-Purpose Audio RepresentationsCode0
A Lightweight Speaker Recognition System Using Timbre Properties0
Remarks on Optimal Scores for Speaker Recognition0
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers0
Integrated Replay Spoofing-aware Text-independent Speaker Verification0
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features0
Identify Speakers in Cocktail Parties with End-to-End AttentionCode0
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio RepresentationCode0
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification0
Speaker Recognition in Bengali Language from Nonlinear Features0
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data0
Speaker Identification using EEG0
Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition0
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention0
Supervised Speaker Embedding De-Mixing in Two-Speaker Environment0
Robust Speaker Recognition Using Speech Enhancement And Attention Model0
The Deterministic plus Stochastic Model of the Residual Signal and its Applications0
Advances in Online Audio-Visual Meeting Transcription0
Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?0
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals0
Reducing audio membership inference attack accuracy to chance: 4 defenses0
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors0
Delving into VoxCeleb: environment invariant speaker recognitionCode0
Word-level Embeddings for Cross-Task Transfer Learning in Speech ProcessingCode0
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model0
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networksCode0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified