SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 301350 of 435 papers

TitleStatusHype
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition0
Towards End-to-End Private Automatic Speaker Recognition0
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization0
Towards Relevance and Sequence Modeling in Language Recognition0
Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks0
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification0
Understanding Contrastive Learning Through the Lens of Margins0
UNISOUND System for VoxCeleb Speaker Recognition Challenge 20230
Universal speaker recognition encoders for different speech segments duration0
UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing0
Unsupervised Adaptation of SPLDA0
Unsupervised Learning of Disentangled Speech Content and Style Representation0
以二維共振峰分布建立語者音色模型及其在語者驗證上之應用 (Using 2D Formant Distribution to Build Speaker Models and Its Application in Speaker Verification) [In Chinese]0
UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation0
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework0
VAE-based regularization for deep speaker embedding0
Variational Autoencoders with implicit priors for short-duration text-independent speaker verification0
Visual Speech Recognition0
Voice Conversion Augmentation for Speaker Recognition on Defective Datasets0
Voice Morphing: Two Identities in One Voice0
Voice Quality and Pitch Features in Transformer-Based Speech Recognition0
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices0
VoxBlink: A Large Scale Speaker Verification Dataset on Camera0
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge0
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge0
VoxVietnam: a Large-Scale Multi-Genre Dataset for Vietnamese Speaker Recognition0
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb0
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes0
WeCanTalk: A New Multi-language, Multi-modal Resource for Speaker Recognition0
We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings0
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis0
Who is Authentic Speaker0
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?0
Xi-Vector Embedding for Speaker Recognition0
XMUSPEECH System for VoxCeleb Speaker Recognition Challenge 20210
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition0
3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization0
以三元組損失微調時延神經網路語者嵌入函數之語者辨識系統(Time Delay Neural Network-based Speaker Embedding Function Fine-tuned with Triplet Loss for Distance-based Speaker Recognition)0
A Benchmark for Understanding and Generating Dialogue between Characters in Stories0
A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments0
A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition0
A comparative study of several parameterizations for speaker recognition0
A comparison of linear and non-linear calibrations for speaker recognition0
A Deep Neural Network for Short-Segment Speaker Recognition0
Adversarial defense for deep speaker recognition using hybrid adversarial training0
Adversarial Speaker Verification0
A Generative Model for Score Normalization in Speaker Recognition0
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion0
A Lightweight Speaker Recognition System Using Timbre Properties0
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning0
Show:102550
← PrevPage 7 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified