SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 101125 of 435 papers

TitleStatusHype
A Study on Bias and Fairness In Deep Speaker Recognition0
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition0
Speaker Recognition in Realistic Scenario Using Multimodal Data0
A Reinforcement Learning Framework for Online Speaker Diarization0
Interpretable Spectrum Transformation Attacks to Speaker Recognition0
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition ChallengeCode1
Probabilistic Back-ends for Online Speaker Recognition and ClusteringCode1
Speaker and Language Change Detection using Wav2vec2 and Whisper0
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementCode1
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile HealthCode0
Audio Representation Learning by Distilling Video as Privileged Information0
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification0
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetCode1
Introducing Model Inversion Attacks on Automatic Speaker Recognition0
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks0
Probing Deep Speaker Embeddings for Speaker-related Tasks0
A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition0
Inconsistency Ranking-based Noisy Label Detection for High-quality DataCode0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition0
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker RecognitionCode0
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization0
I4U System Description for NIST SRE'20 CTS Challenge0
Show:102550
← PrevPage 5 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified