SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 201225 of 435 papers

TitleStatusHype
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
SEC4SR: A Security Analysis Platform for Speaker RecognitionCode1
Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent SpaceCode0
NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition0
Xi-Vector Embedding for Speaker Recognition0
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation0
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument soundsCode0
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems0
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis0
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition0
Graph-based Label Propagation for Semi-Supervised Speaker Identification0
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework0
Improving Fairness in Speaker Recognition0
Exploring Deep Learning for Joint Audio-Visual Lip BiometricsCode1
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
Speaker embeddings by modeling channel-wise correlationsCode1
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification System0
Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition0
EfficientTDNN: Efficient Architecture Search for Speaker RecognitionCode1
Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining0
Content-Aware Speaker Embeddings for Speaker Diarisation0
U-vectors: Generating clusterable speaker embedding from unlabeled dataCode0
Show:102550
← PrevPage 9 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified