SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 51100 of 435 papers

TitleStatusHype
Speaker Characterization by means of Attention Pooling0
Who is Authentic Speaker0
Certification of Speaker Recognition Models to Additive PerturbationsCode0
Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech0
TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches0
Voice Conversion Augmentation for Speaker Recognition on Defective Datasets0
3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and DiarizationCode0
Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 20
Cosine Scoring with Uncertainty for Neural Speaker Embedding0
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf modelsCode3
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models0
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices0
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes0
Phonetic-aware speaker embedding for far-field speaker verification0
Parrot-Trained Adversarial Examples: Pushing the Practicality of Black-Box Audio Attacks against Speaker Recognition Models0
Personalizing Keyword Spotting with Speaker Information0
Detecting Agreement in Multi-party Conversational AI0
Deep Neural Networks for Automatic Speaker Recognition Do Not Learn Supra-Segmental Temporal Features0
UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing0
Take the aTrain. Introducing an Interface for the Accessible Transcription of InterviewsCode3
Privacy-oriented manipulation of speaker representations0
Thech. Report: Genuinization of Speech waveform PMF for speaker detection spoofing and countermeasures0
Disentangling Voice and Content with Self-Supervision for Speaker Recognition0
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker RecognitionCode3
SLMIA-SR: Speaker-Level Membership Inference Attacks against Speaker Recognition SystemsCode0
Voice Morphing: Two Identities in One Voice0
UNISOUND System for VoxCeleb Speaker Recognition Challenge 20230
The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 20230
Graph Neural Network Backend for Speaker Recognition0
The ID R&D VoxCeleb Speaker Recognition Challenge 2023 System Description0
ChinaTelecom System Description to VoxCeleb Speaker Recognition Challenge 20230
GIST-AiTeR Speaker Diarization System for VoxCeleb Speaker Recognition Challenge (VoxSRC) 20230
The DKU-MSXF Diarization System for the VoxCeleb Speaker Recognition Challenge 20230
VoxBlink: A Large Scale Speaker Verification Dataset on Camera0
On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer0
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation0
Facial Landmark Detection Evaluation on MOBIO Database0
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb0
Understanding Contrastive Learning Through the Lens of Margins0
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?Code1
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition0
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions0
Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks0
Ordered and Binary Speaker Embedding0
Generalized domain adaptation framework for parametric back-end in speaker recognition0
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?Code0
QFA2SR: Query-Free Adversarial Transfer Attacks to Speaker Recognition Systems0
Vocal Style Factorization for Effective Speaker Recognition in Affective ScenariosCode0
A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition0
The Graph feature fusion technique for speaker recognition based on wav2vec2.0 framework0
Show:102550
← PrevPage 2 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified