SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 125 of 435 papers

TitleStatusHype
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
Pushing the limits of raw waveform speaker recognitionCode3
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf modelsCode3
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker RecognitionCode3
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram MaskingCode2
SEED: Speaker Embedding Enhancement Diffusion ModelCode2
Reshape Dimensions Network for Speaker RecognitionCode2
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddingsCode1
NPLDA: A Deep Neural PLDA Model for Speaker VerificationCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel’s Weekly Video PodcastsCode1
Neural PLDA Modeling for End-to-End Speaker VerificationCode1
Probabilistic Back-ends for Online Speaker Recognition and ClusteringCode1
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetCode1
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end modelCode1
Fine-tuning wav2vec2 for speaker recognitionCode1
Leveraging speaker attribute information using multi task learning for speaker verification and diarizationCode1
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language RecognitionCode1
Exploring Deep Learning for Joint Audio-Visual Lip BiometricsCode1
Adversarial Attack and Defense Strategies for Deep Speaker Recognition SystemsCode1
HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRECode1
Crossed-Time Delay Neural Network for Speaker RecognitionCode1
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length PairsCode1
AutoSpeech: Neural Architecture Search for Speaker RecognitionCode1
Show:102550
← PrevPage 1 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified