SOTAVerified

Speaker Identification

Papers

Showing 126150 of 248 papers

TitleStatusHype
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets0
FastAudio: A Learnable Audio Front-End for Spoof Speech DetectionCode1
Towards Making the Most of Dialogue Characteristics for Neural Chat TranslationCode0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
A Real-time Speaker Diarization System Based on Spatial Spectrum0
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems0
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus0
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition0
Graph-based Label Propagation for Semi-Supervised Speaker Identification0
Learning Audio-Visual DereverberationCode1
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation UnderstandingCode1
Supervised Speech Representation Learning for Parkinson's Disease ClassificationCode1
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings0
End-to-End Speaker-Attributed ASR with Transformer0
Streaming Multi-talker Speech Recognition with Joint Speaker Identification0
A Survey on Paralinguistics in Tamil Speech Processing0
Speech Resynthesis from Discrete Disentangled Self-Supervised RepresentationsCode1
Voice Privacy with Smart Digital Assistants in Educational Settings0
Blind Speech Separation and Dereverberation using Neural BeamformingCode1
Triplet loss based embeddings for forensic speaker identification in Spanish0
A Modulation-Domain Loss for Neural-Network-based Real-time Speech EnhancementCode1
CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions0
Speaker attribution with voice profiles by graph-based semi-supervised learning0
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenarioCode0
Show:102550
← PrevPage 6 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified