SOTAVerified

Speaker Identification

Papers

Showing 2650 of 248 papers

TitleStatusHype
Learning Audio-Visual DereverberationCode1
MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation UnderstandingCode1
Supervised Speech Representation Learning for Parkinson's Disease ClassificationCode1
Speech Resynthesis from Discrete Disentangled Self-Supervised RepresentationsCode1
Blind Speech Separation and Dereverberation using Neural BeamformingCode1
A Modulation-Domain Loss for Neural-Network-based Real-time Speech EnhancementCode1
Deep Discriminative Feature Learning for Accent RecognitionCode1
FoolHD: Fooling speaker identification by Highly imperceptible adversarial DisturbancesCode1
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASRCode1
Sum-Product Networks for Robust Automatic Speaker IdentificationCode1
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker RecordingsCode1
AutoSpeech: Neural Architecture Search for Speaker RecognitionCode1
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length PairsCode1
AM-MobileNet1D: A Portable Model for Speaker RecognitionCode1
Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition ModelsCode1
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeamCode1
Generative Pre-Training for Speech with Autoregressive Predictive CodingCode1
Learning Speaker Representations with Mutual InformationCode1
Speaker Recognition from Raw Waveform with SincNetCode1
CoLMbo: Speaker Language Model for Descriptive ProfilingCode0
Rhythm Features for Speaker Identification0
French Listening Tests for the Assessment of Intelligibility, Quality, and Identity of Body-Conducted Speech Enhancement0
Speech Unlearning0
Pretraining Multi-Speaker Identification for Neural Speaker Diarization0
REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion0
Show:102550
← PrevPage 2 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified