SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 150 of 435 papers

TitleStatusHype
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker RecognitionCode3
Pushing the limits of raw waveform speaker recognitionCode3
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf modelsCode3
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
Reshape Dimensions Network for Speaker RecognitionCode2
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram MaskingCode2
SEED: Speaker Embedding Enhancement Diffusion ModelCode2
USEF-TSE: Universal Speaker Embedding Free Target Speaker ExtractionCode1
Towards Understanding and Mitigating Audio Adversarial Examples for Speaker RecognitionCode1
TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechCode1
Universal Adversarial Perturbations Generative Network for Speaker RecognitionCode1
Unsupervised Representation Learning for Speaker Recognition via Contrastive Equilibrium LearningCode1
Training speaker recognition systems with limited dataCode1
Utterance-level Aggregation For Speaker Recognition In The WildCode1
Speech and Speaker Recognition from Raw Waveform with SincNetCode1
Speaker Recognition in the WildCode1
Speaker anonymisation using the McAdams coefficientCode1
Speaker recognition with two-step multi-modal deep cleansingCode1
Fine-tuning wav2vec2 for speaker recognitionCode1
Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition ModelsCode1
NPLDA: A Deep Neural PLDA Model for Speaker VerificationCode1
Temporal Dynamic Convolutional Neural Network for Text-Independent Speaker Verification and Phonemetic AnalysisCode1
Leveraging speaker attribute information using multi task learning for speaker verification and diarizationCode1
Toroidal Probabilistic Spherical Discriminant AnalysisCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel’s Weekly Video PodcastsCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video PodcastsCode1
Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length PairsCode1
Neural PLDA Modeling for End-to-End Speaker VerificationCode1
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?Code1
Deep Discriminative Feature Learning for Accent RecognitionCode1
Adversarial Attack and Defense Strategies for Deep Speaker Recognition SystemsCode1
SEC4SR: A Security Analysis Platform for Speaker RecognitionCode1
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetCode1
Probabilistic Back-ends for Online Speaker Recognition and ClusteringCode1
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language RecognitionCode1
Crossed-Time Delay Neural Network for Speaker RecognitionCode1
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end modelCode1
Self-supervised Speaker Recognition with Loss-gated LearningCode1
Speaker embeddings by modeling channel-wise correlationsCode1
Speaker Recognition from Raw Waveform with SincNetCode1
Bias in Automated Speaker RecognitionCode1
EfficientTDNN: Efficient Architecture Search for Speaker RecognitionCode1
Exploring Deep Learning for Joint Audio-Visual Lip BiometricsCode1
AutoSpeech: Neural Architecture Search for Speaker RecognitionCode1
AM-MobileNet1D: A Portable Model for Speaker RecognitionCode1
HLT-NUS SUBMISSION FOR 2020 NIST Conversational Telephone Speech SRECode1
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementCode1
Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddingsCode1
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified