SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 101150 of 435 papers

TitleStatusHype
A Study on Bias and Fairness In Deep Speaker Recognition0
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition0
Speaker Recognition in Realistic Scenario Using Multimodal Data0
A Reinforcement Learning Framework for Online Speaker Diarization0
Interpretable Spectrum Transformation Attacks to Speaker Recognition0
VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition ChallengeCode1
Probabilistic Back-ends for Online Speaker Recognition and ClusteringCode1
Speaker and Language Change Detection using Wav2vec2 and Whisper0
TAPLoss: A Temporal Acoustic Parameter Loss for Speech EnhancementCode1
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile HealthCode0
Audio Representation Learning by Distilling Video as Privileged Information0
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification0
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetCode1
Introducing Model Inversion Attacks on Automatic Speaker Recognition0
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks0
Probing Deep Speaker Embeddings for Speaker-related Tasks0
A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition0
Inconsistency Ranking-based Noisy Label Detection for High-quality DataCode0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition0
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker RecognitionCode0
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization0
I4U System Description for NIST SRE'20 CTS Challenge0
Disentangled representation learning for multilingual speaker recognition0
Speaker recognition with two-step multi-modal deep cleansingCode1
Universal speaker recognition encoders for different speech segments duration0
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs0
Toroidal Probabilistic Spherical Discriminant AnalysisCode1
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach0
Large-scale learning of generalised representations for speaker recognition0
Risk of re-identification for shared clinical speech recordingsCode0
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning0
THUEE system description for NIST 2020 SRE CTS challenge0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 20220
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
The SpeakIn System Description for CNSRC20220
GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge0
The ReturnZero System for VoxCeleb Speaker Recognition Challenge 20220
The Royalflush System for VoxCeleb Speaker Recognition Challenge 20220
A Benchmark for Understanding and Generating Dialogue between Characters in Stories0
Disentangled Speaker Representation Learning via Mutual Information Minimization0
Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition0
Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception0
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification0
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion0
Towards End-to-End Private Automatic Speaker Recognition0
Towards Understanding and Mitigating Audio Adversarial Examples for Speaker RecognitionCode1
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems0
Show:102550
← PrevPage 3 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified