SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 151175 of 435 papers

TitleStatusHype
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
Deep factorization for speech signal0
Deep CNN based feature extractor for text-prompted speaker recognition0
A Unified Deep Neural Network for Speaker and Language Recognition0
Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition0
Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments0
Augmentation adversarial training for self-supervised speaker recognition0
An Ensemble SVM-based Approach for Voice Activity Detection0
Adversarial Speaker Verification0
Investigation of Using VAE for i-Vector Speaker Verification0
Cross-modal Speaker Verification and Recognition: A Multilingual Perspective0
Audio-visual Speaker Recognition with a Cross-modal Discriminative Network0
Introduction to Voice Presentation Attack Detection and Recent Advances0
Introducing Model Inversion Attacks on Automatic Speaker Recognition0
Interpretable Spectrum Transformation Attacks to Speaker Recognition0
Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks0
An Effortless Way To Create Large-Scale Datasets For Famous Speakers0
Influence of Mother Tongue on English Accent0
Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution0
Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Incorporation of Speech Duration Information in Score Fusion of Speaker Recognition Systems0
iQIYI-VID: A Large Dataset for Multi-modal Person Identification0
Cosine Scoring with Uncertainty for Neural Speaker Embedding0
Show:102550
← PrevPage 7 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified