SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 2650 of 435 papers

TitleStatusHype
Toroidal Probabilistic Spherical Discriminant AnalysisCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel’s Weekly Video PodcastsCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video PodcastsCode1
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length PairsCode1
Neural PLDA Modeling for End-to-End Speaker VerificationCode1
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?Code1
Deep Discriminative Feature Learning for Accent RecognitionCode1
Adversarial Attack and Defense Strategies for Deep Speaker Recognition SystemsCode1
SEC4SR: A Security Analysis Platform for Speaker RecognitionCode1
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetCode1
Probabilistic Back-ends for Online Speaker Recognition and ClusteringCode1
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language RecognitionCode1
Crossed-Time Delay Neural Network for Speaker RecognitionCode1
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end modelCode1
Self-supervised Speaker Recognition with Loss-gated LearningCode1
Speaker embeddings by modeling channel-wise correlationsCode1
Speaker Recognition from Raw Waveform with SincNetCode1
Bias in Automated Speaker RecognitionCode1
EfficientTDNN: Efficient Architecture Search for Speaker RecognitionCode1
Exploring Deep Learning for Joint Audio-Visual Lip BiometricsCode1
AutoSpeech: Neural Architecture Search for Speaker RecognitionCode1
AM-MobileNet1D: A Portable Model for Speaker RecognitionCode1
HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRECode1
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementCode1
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddingsCode1
Show:102550
← PrevPage 2 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified