SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 251300 of 435 papers

TitleStatusHype
Building and Evaluation of a Real Room Impulse Response Dataset0
BUT System Description to VoxCeleb Speaker Recognition Challenge 20190
BUT VOiCES 2019 System Description0
Call My Net 2: A New Resource for Speaker Recognition0
Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection0
CEC: A Noisy Label Detection Method for Speaker Recognition0
Centroid-based deep metric learning for speaker recognition0
Challenging margin-based speaker embedding extractors by using the variational information bottleneck0
Channel adversarial training for cross-channel text-independent speaker recognition0
The xx205 System for the VoxCeleb Speaker Recognition Challenge 20200
They are wearing a mask! Identification of Subjects Wearing a Surgical Mask from their Speech by means of x-vectors and Fisher Vectors0
THUEE system description for NIST 2019 SRE CTS Challenge0
THUEE system description for NIST 2020 SRE CTS challenge0
Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition0
TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches0
Tongji University Undergraduate Team for the VoxCeleb Speaker Recognition Challenge20200
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition0
Towards End-to-End Private Automatic Speaker Recognition0
Late Audio-Visual Fusion for In-The-Wild Speaker Diarization0
Towards Relevance and Sequence Modeling in Language Recognition0
Transforming the Embeddings: A Lightweight Technique for Speech Emotion Recognition Tasks0
Triplet Based Embedding Distance and Similarity Learning for Text-independent Speaker Verification0
Understanding Contrastive Learning Through the Lens of Margins0
UNISOUND System for VoxCeleb Speaker Recognition Challenge 20230
Universal speaker recognition encoders for different speech segments duration0
UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing0
Unsupervised Adaptation of SPLDA0
Unsupervised Learning of Disentangled Speech Content and Style Representation0
以二維共振峰分布建立語者音色模型及其在語者驗證上之應用 (Using 2D Formant Distribution to Build Speaker Models and Its Application in Speaker Verification) [In Chinese]0
UTD-CRSS Systems for 2016 NIST Speaker Recognition Evaluation0
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework0
VAE-based regularization for deep speaker embedding0
Variational Autoencoders with implicit priors for short-duration text-independent speaker verification0
Visual Speech Recognition0
Voice Conversion Augmentation for Speaker Recognition on Defective Datasets0
Voice Morphing: Two Identities in One Voice0
Voice Quality and Pitch Features in Transformer-Based Speech Recognition0
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices0
VoxBlink: A Large Scale Speaker Verification Dataset on Camera0
VoxSRC 2019: The first VoxCeleb Speaker Recognition Challenge0
VoxSRC 2020: The Second VoxCeleb Speaker Recognition Challenge0
VoxVietnam: a Large-Scale Multi-Genre Dataset for Vietnamese Speaker Recognition0
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb0
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes0
WeCanTalk: A New Multi-language, Multi-modal Resource for Speaker Recognition0
We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings0
What do End-to-End Speech Models Learn about Speaker, Language and Channel Information? A Layer-wise and Neuron-level Analysis0
Who is Authentic Speaker0
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?0
Xi-Vector Embedding for Speaker Recognition0
Show:102550
← PrevPage 6 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified