SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 151200 of 435 papers

TitleStatusHype
Speaker and Language Change Detection using Wav2vec2 and Whisper0
Masking Kernel for Learning Energy-Efficient Representations for Speaker Recognition and Mobile HealthCode0
Audio Representation Learning by Distilling Video as Privileged Information0
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification0
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description0
Introducing Model Inversion Attacks on Automatic Speaker Recognition0
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks0
Probing Deep Speaker Embeddings for Speaker-related Tasks0
A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition0
Inconsistency Ranking-based Noisy Label Detection for High-quality DataCode0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
Multi-source Domain Adaptation for Text-independent Forensic Speaker Recognition0
Is Style All You Need? Dependencies Between Emotion and GST-based Speaker RecognitionCode0
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization0
I4U System Description for NIST SRE'20 CTS Challenge0
Disentangled representation learning for multilingual speaker recognition0
Universal speaker recognition encoders for different speech segments duration0
Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs0
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach0
Large-scale learning of generalised representations for speaker recognition0
Risk of re-identification for shared clinical speech recordingsCode0
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning0
THUEE system description for NIST 2020 SRE CTS challenge0
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 20220
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 20220
The SpeakIn System Description for CNSRC20220
The ReturnZero System for VoxCeleb Speaker Recognition Challenge 20220
GIST-AiTeR System for the Diarization Task of the 2022 VoxCeleb Speaker Recognition Challenge0
The Royalflush System for VoxCeleb Speaker Recognition Challenge 20220
A Benchmark for Understanding and Generating Dialogue between Characters in Stories0
Disentangled Speaker Representation Learning via Mutual Information Minimization0
Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition0
Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception0
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification0
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion0
Towards End-to-End Private Automatic Speaker Recognition0
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems0
WeCanTalk: A New Multi-language, Multi-modal Resource for Speaker Recognition0
Far-Field Speaker Recognition Benchmark Derived From The DiPCo Corpus0
Dynamic Recognition of Speakers for Consent Management by Contrastive Embedding Replay0
Baselines and Protocols for Household Speaker RecognitionCode0
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?0
Graph Convolutional Network Based Semi-Supervised Learning on Multi-Speaker Meeting Data0
The 2021 NIST Speaker Recognition Evaluation0
The NIST CTS Speaker Recognition Challenge0
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective0
Robust Speaker Recognition with Transformers Using wav2vec 2.00
Curriculum learning for self-supervised speaker verification0
To train or not to train adversarially: A study of bias mitigation strategies for speaker recognitionCode0
Show:102550
← PrevPage 4 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified