SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 151200 of 435 papers

TitleStatusHype
A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments0
DeepMSRF: A novel Deep Multimodal Speaker Recognition framework with Feature selection0
LEAP System for SRE19 CTS Challenge -- Improvements and Error Analysis0
Deep learning methods in speaker recognition: a review0
Deep Learning for Single and Multi-Session i-Vector Speaker Recognition0
Large-scale learning of generalised representations for speaker recognition0
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
Deep factorization for speech signal0
Deep CNN based feature extractor for text-prompted speaker recognition0
A Unified Deep Neural Network for Speaker and Language Recognition0
Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition0
Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments0
Augmentation adversarial training for self-supervised speaker recognition0
An Ensemble SVM-based Approach for Voice Activity Detection0
Adversarial Speaker Verification0
Investigation of Using VAE for i-Vector Speaker Verification0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction0
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective0
Audio-visual Speaker Recognition with a Cross-modal Discriminative Network0
Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution0
Introduction to Voice Presentation Attack Detection and Recent Advances0
iQIYI-VID: A Large Dataset for Multi-modal Person Identification0
Introducing Model Inversion Attacks on Automatic Speaker Recognition0
結合I-Vector 及深層神經網路之語者驗證系統 (Text-independent Speaker Verification using a Hybrid I-Vector/DNN Approach) [In Chinese]0
Joint Probabilistic Linear Discriminant Analysis0
Joint Sound Source Separation and Speaker Recognition0
JukeBox: A Multilingual Singer Recognition Dataset0
Fine-grained Early Frequency Attention for Deep Speaker Representation Learning0
KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation0
Language Modelling for Speaker Diarization in Telephonic Interviews0
Interpretable Spectrum Transformation Attacks to Speaker Recognition0
LASPA: Language Agnostic Speaker Disentanglement with Prefix-Tuned Cross-Attention0
Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks0
LDC Language Resource Database: Building a Bibliographic Database0
An Effortless Way To Create Large-Scale Datasets For Famous Speakers0
Learning Speaker-Invariant Visual Features for Lipreading0
Length- and Noise-aware Training Techniques for Short-utterance Speaker Recognition0
Influence of Mother Tongue on English Accent0
Incorporation of Speech Duration Information in Score Fusion of Speaker Recognition Systems0
Leveraging Speaker Embeddings with Adversarial Multi-task Learning for Age Group Classification0
Likelihood-ratio calibration using prior-weighted proper scoring rules0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
Cosine Scoring with Uncertainty for Neural Speaker Embedding0
Machine Speech Chain with One-shot Speaker Adaptation0
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach0
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model0
CopyPaste: An Augmentation Method for Speech Emotion Recognition0
Audio Representation Learning by Distilling Video as Privileged Information0
Show:102550
← PrevPage 4 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified