SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 401435 of 435 papers

TitleStatusHype
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker RecognitionCode0
Robust speaker recognition using unsupervised adversarial invarianceCode0
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic EnvironmentsCode0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data AugmentationCode0
Personal VAD: Speaker-Conditioned Voice Activity DetectionCode0
Inconsistency Ranking-based Noisy Label Detection for High-quality DataCode0
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
Improving fairness in speaker verification via Group-adapted Fusion NetworkCode0
CoLMbo: Speaker Language Model for Descriptive ProfilingCode0
Who is Real Bob? Adversarial Attacks on Speaker Recognition SystemsCode0
Filterbank design for end-to-end speech separationCode0
DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech SynthesisCode0
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersCode0
Deep Speaker Vector Normalization with Maximum Gaussianality TrainingCode0
Deep Speaker: an End-to-End Neural Speaker Embedding SystemCode0
U-vectors: Generating clusterable speaker embedding from unlabeled dataCode0
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative statesCode0
Unified Hypersphere Embedding for Speaker RecognitionCode0
VoxCeleb2: Deep Speaker RecognitionCode0
CN-CELEB: a challenging Chinese speaker recognition datasetCode0
Version Control of Speaker Recognition SystemsCode0
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition SystemsCode0
Three-Dimensional Lip Motion Network for Text-Independent Speaker RecognitionCode0
Attention-Based Models for Text-Dependent Speaker VerificationCode0
Vocal Style Factorization for Effective Speaker Recognition in Affective ScenariosCode0
Additive Margin SincNet for Speaker RecognitionCode0
Masked Proxy Loss For Text-Independent Speaker VerificationCode0
Certification of Speaker Recognition Models to Additive PerturbationsCode0
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?Code0
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile HealthCode0
PF-Net: Personalized Filter for Speaker Recognition from Raw WaveformCode0
Use of speaker recognition approaches for learning and evaluating embedding representations of musical instrument soundsCode0
Excitement Surfeited Turns to Errors: Deep Learning Testing Framework Based on Excitable NeuronsCode0
Deep Normalization for Speaker VectorsCode0
Show:102550
← PrevPage 9 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified