SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 51100 of 435 papers

TitleStatusHype
Bias in Automated Speaker RecognitionCode1
TERA: Self-Supervised Learning of Transformer Encoder Representation for SpeechCode1
Toroidal Probabilistic Spherical Discriminant AnalysisCode1
Crossed-Time Delay Neural Network for Speaker RecognitionCode1
Version Control of Speaker Recognition SystemsCode0
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?Code0
U-vectors: Generating clusterable speaker embedding from unlabeled dataCode0
Additive Margin SincNet for Speaker RecognitionCode0
Unified Hypersphere Embedding for Speaker RecognitionCode0
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument soundsCode0
VoxCeleb2: Deep Speaker RecognitionCode0
Vocal Style Factorization for Effective Speaker Recognition in Affective ScenariosCode0
Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data AugmentationCode0
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic EnvironmentsCode0
Three-Dimensional Lip Motion Network for Text-Independent Speaker RecognitionCode0
To train or not to train adversarially: A study of bias mitigation strategies for speaker recognitionCode0
Excitement Surfeited Turns to Errors: Deep Learning Testing Framework Based on Excitable NeuronsCode0
Baselines and Protocols for Household Speaker RecognitionCode0
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative statesCode0
Deep Normalization for Speaker VectorsCode0
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition SystemsCode0
Deep generative LDACode0
SpeakerStew: Scaling to Many Languages with a Triaged Multilingual Text-Dependent and Text-Independent Speaker Verification SystemCode0
Risk of re-identification for shared clinical speech recordingsCode0
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
Private kNN-VC: Interpretable Anonymization of Converted SpeechCode0
COVID-19 Patient Detection from Telephone Quality Speech DataCode0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
Attention-Based Models for Text-Dependent Speaker VerificationCode0
Inconsistency Ranking-based Noisy Label Detection for High-quality DataCode0
Personal VAD: Speaker-Conditioned Voice Activity DetectionCode0
Prosody-Driven Privacy-Preserving Dementia DetectionCode0
Masked Proxy Loss For Text-Independent Speaker VerificationCode0
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile HealthCode0
Curricular SincNet: Towards Robust Deep Speaker Recognition by Emphasizing Hard Samples in Latent SpaceCode0
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker RecognitionCode0
Improving fairness in speaker verification via Group-adapted Fusion NetworkCode0
Robust speaker recognition using unsupervised adversarial invarianceCode0
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networksCode0
CoLMbo: Speaker Language Model for Descriptive ProfilingCode0
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersCode0
CN-CELEB: a challenging Chinese speaker recognition datasetCode0
Filterbank design for end-to-end speech separationCode0
3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and DiarizationCode0
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech SynthesisCode0
Deep Speaker Vector Normalization with Maximum Gaussianality TrainingCode0
Delving into VoxCeleb: environment invariant speaker recognitionCode0
Certification of Speaker Recognition Models to Additive PerturbationsCode0
Deep Speaker: an End-to-End Neural Speaker Embedding SystemCode0
Show:102550
← PrevPage 2 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified