SOTAVerified

Speaker Verification

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Papers

Showing 150 of 746 papers

TitleStatusHype
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Magnitude-aware Probabilistic Speaker EmbeddingsCode3
SALMONN: Towards Generic Hearing Abilities for Large Language ModelsCode3
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf modelsCode3
Pushing the limits of raw waveform speaker recognitionCode3
Ludwig: a type-based declarative deep learning toolboxCode3
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker VerificationCode3
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
Towards A Unified Conformer Structure: from ASR to ASV TaskCode2
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled ModalityCode2
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERTCode2
Generalized End-to-End Loss for Speaker VerificationCode1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
FilterAugment: An Acoustic Environmental Data Augmentation MethodCode1
Extended U-Net for Speaker Verification in Noisy EnvironmentsCode1
Explainable deepfake and spoofing detection: an attack analysis using SHapley Additive exPlanationsCode1
FastAudio: A Learnable Audio Front-End for Spoof Speech DetectionCode1
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw WaveformsCode1
Disentanglement in a GAN for Unconditional Speech SynthesisCode1
Deep multi-metric learning for text-independent speaker verificationCode1
DropClass and DropAdapt: Dropping classes for deep speaker representation learningCode1
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake DetectionCode1
Evaluation of Speech Representations for MOS predictionCode1
Exploring Binary Classification Loss For Speaker VerificationCode1
ExPO: Explainable Phonetic Trait-Oriented Network for Speaker VerificationCode1
Cross-modal Audio-visual Co-learning for Text-independent Speaker VerificationCode1
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learningCode1
FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With AttentionCode1
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback ConstraintCode1
Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech AttacksCode1
Cross-modal information fusion for voice spoofing detectionCode1
DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker VerificationCode1
AutoSpeech: Neural Architecture Search for Speaker RecognitionCode1
Backdoor Attack against Speaker VerificationCode1
Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer LearningCode1
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment UtterancesCode1
A Fully Tensorized Recurrent Neural NetworkCode1
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure SystemsCode1
A Probabilistic Fusion Framework for Spoofing Aware Speaker VerificationCode1
Cross-Age Speaker Verification: Learning Age-Invariant Speaker EmbeddingsCode1
Crossed-Time Delay Neural Network for Speaker RecognitionCode1
CryCeleb: A Speaker Verification Dataset Based on Infant Cry SoundsCode1
CTAL: Pre-training Cross-modal Transformer for Audio-and-Language RepresentationsCode1
DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice ConversionCode1
Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic ModelsCode1
Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof DetectionCode1
End-to-end anti-spoofing with RawNet2Code1
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentationCode1
Bias in Automated Speaker RecognitionCode1
Show:102550
← PrevPage 1 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Multi Task SSLEER1.98Unverified
2ReDimNet-B0-LM (1.0M)EER1.16Unverified
3TitanNet -SEER1.15Unverified
4ReDimNet-B0-LM-ASNorm (1.0M)EER1.07Unverified
5SpeechNASEER1.02Unverified
6ReDimNet-B1-LM (2.2M)EER0.85Unverified
7TitanNet -MEER0.81Unverified
8ReDimNet-B1-LM-ASNorm (2.2M)EER0.73Unverified
9TitanNet -LEER0.68Unverified
10ReDimNet-B2-SF2-LM (4.7M)EER0.57Unverified
#ModelMetricClaimedVerifiedStatus
1Fine-tuned HuBERT LargeEER2.36Unverified
2ReDimNet-B0-LM (1.0M)EER1.16Unverified
3ReDimNet-B0-LM-ASNorm (1.0M)EER1.07Unverified
4SpeechNASEER1.02Unverified
5ReDimNet-B1-LM (2.2M)EER0.85Unverified
6ReDimNet-B1-LM-ASNorm (2.2M)EER0.73Unverified
7ReDimNet-B2-SF2-LM (4.7M)EER0.57Unverified
8ReDimNet-B2-SF2-LM-ASNorm (4.7M)EER0.52Unverified
9ReDimNet-B4-LM (6.3M)EER0.51Unverified
10ReDimNet-B3-LM (3.0M)EER0.5Unverified
#ModelMetricClaimedVerifiedStatus
1GE2ECosine EER3.55Unverified
2Cosine EER2.38Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet with Attention BackendEER10.77Unverified
2X-Vectors with Attention BackendEER10.12Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA-TDNNminDCF0Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.01Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.03Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.02Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.04Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50EER100Unverified