SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 201250 of 435 papers

TitleStatusHype
The DKU-DukeECE-Lenovo System for the Diarization Task of the 2021 VoxCeleb Speaker Recognition Challenge0
SEC4SR: A Security Analysis Platform for Speaker RecognitionCode1
Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent SpaceCode0
NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition0
Xi-Vector Embedding for Speaker Recognition0
Improved Speech Emotion Recognition using Transfer Learning and Spectrogram Augmentation0
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument soundsCode0
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems0
Dropout Regularization for Self-Supervised Learning of Transformer Encoder Speech Representation0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis0
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition0
Graph-based Label Propagation for Semi-Supervised Speaker Identification0
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework0
Improving Fairness in Speaker Recognition0
Exploring Deep Learning for Joint Audio-Visual Lip BiometricsCode1
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
Speaker embeddings by modeling channel-wise correlationsCode1
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification SystemCode0
Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker RecognitionCode0
EfficientTDNN: Efficient Architecture Search for Speaker RecognitionCode1
Semi Supervised Learning For Few-shot Audio Classification By Episodic Triplet Mining0
Content-Aware Speaker Embeddings for Speaker Diarisation0
U-vectors: Generating clusterable speaker embedding from unlabeled dataCode0
Study of Pre-processing Defenses against Adversarial Attacks on State-of-the-art Speaker Recognition Systems0
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge0
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech SynthesisCode0
Speaker Recognition Based on Deep Learning: An Overview0
Deep Discriminative Feature Learning for Accent RecognitionCode1
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech0
An Empirical Study on Text-Independent Speaker Verification based on the GE2E Method0
COVID-19 Patient Detection from Telephone Quality Speech DataCode0
Masked Proxy Loss For Text-Independent Speaker VerificationCode0
Query Expansion System for the VoxCeleb Speaker Recognition Challenge 20200
ShaneRun System Description to VoxCeleb Speaker Recognition Challenge 20200
Speaker anonymisation using the McAdams coefficientCode1
The xx205 System for the VoxCeleb Speaker Recognition Challenge 20200
Adversarial defense for deep speaker recognition using hybrid adversarial training0
Deep Speaker Vector Normalization with Maximum Gaussianality TrainingCode0
Deep generative LDACode0
The UPC Speaker Verification System Submitted to VoxCeleb Speaker Recognition Challenge 2020 (VoxSRC-20)0
CopyPaste: An Augmentation Method for Speech Emotion Recognition0
Leveraging speaker attribute information using multi task learning for speaker verification and diarizationCode1
Unsupervised Learning of Disentangled Speech Content and Style Representation0
Momentum Contrast Speaker Representation Learning0
The HUAWEI Speaker Diarisation System for the VoxCeleb Speaker Diarisation Challenge0
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium LearningCode1
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge20200
Three-Dimensional Lip Motion Network for Text-Independent Speaker RecognitionCode0
Show:102550
← PrevPage 5 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified