SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 5175 of 435 papers

TitleStatusHype
SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker VerificationCode1
Universal Adversarial Perturbations Generative Network for Speaker RecognitionCode1
Toroidal Probabilistic Spherical Discriminant AnalysisCode1
BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language RecognitionCode1
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition SystemsCode0
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?Code0
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification SystemCode0
Robust speaker recognition using unsupervised adversarial invarianceCode0
Additive Margin SincNet for Speaker RecognitionCode0
Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data AugmentationCode0
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic EnvironmentsCode0
Prosody-Driven Privacy-Preserving Dementia DetectionCode0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
Baselines and Protocols for Household Speaker RecognitionCode0
Private kNN-VC: Interpretable Anonymization of Converted SpeechCode0
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative statesCode0
Personal VAD: Speaker-Conditioned Voice Activity DetectionCode0
Risk of re-identification for shared clinical speech recordingsCode0
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersCode0
Inconsistency Ranking-based Noisy Label Detection for High-quality DataCode0
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile HealthCode0
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker RecognitionCode0
Attention-Based Models for Text-Dependent Speaker VerificationCode0
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networksCode0
Show:102550
← PrevPage 3 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified