SOTAVerified

Speaker Verification

Speaker verification is the verifying the identity of a person from characteristics of the voice.

( Image credit: Contrastive-Predictive-Coding-PyTorch )

Papers

Showing 150 of 746 papers

TitleStatusHype
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Pushing the limits of raw waveform speaker recognitionCode3
Ludwig: a type-based declarative deep learning toolboxCode3
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf modelsCode3
SALMONN: Towards Generic Hearing Abilities for Large Language ModelsCode3
Magnitude-aware Probabilistic Speaker EmbeddingsCode3
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker VerificationCode3
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled ModalityCode2
Towards A Unified Conformer Structure: from ASR to ASV TaskCode2
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERTCode2
Singer Identity Representation Learning using Self-Supervised TechniquesCode2
Generalized End-to-End Loss for Speaker VerificationCode1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
FragmentVC: Any-to-Any Voice Conversion by End-to-End Extracting and Fusing Fine-Grained Voice Fragments With AttentionCode1
FastAudio: A Learnable Audio Front-End for Spoof Speech DetectionCode1
ExPO: Explainable Phonetic Trait-Oriented Network for Speaker VerificationCode1
FilterAugment: An Acoustic Environmental Data Augmentation MethodCode1
Improved RawNet with Feature Map Scaling for Text-independent Speaker Verification using Raw WaveformsCode1
End-to-End Spectro-Temporal Graph Attention Networks for Speaker Verification Anti-Spoofing and Speech Deepfake DetectionCode1
Attack on practical speaker verification system using universal adversarial perturbationsCode1
Audio Spoofing Verification using Deep Convolutional Neural Networks by Transfer LearningCode1
Cross-modal Audio-visual Co-learning for Text-independent Speaker VerificationCode1
Exploring Binary Classification Loss For Speaker VerificationCode1
Extended U-Net for Speaker Verification in Noisy EnvironmentsCode1
Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech AttacksCode1
DS-TDNN: Dual-stream Time-delay Neural Network with Global-aware Filter for Speaker VerificationCode1
Crossed-Time Delay Neural Network for Speaker RecognitionCode1
Cross-Age Speaker Verification: Learning Age-Invariant Speaker EmbeddingsCode1
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback ConstraintCode1
Automatic speaker verification spoofing and deepfake detection using wav2vec 2.0 and data augmentationCode1
Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack DetectionCode1
Evaluation of Speech Representations for MOS predictionCode1
Deep multi-metric learning for text-independent speaker verificationCode1
DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice ConversionCode1
An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learningCode1
An Unsupervised Autoregressive Model for Speech Representation LearningCode1
A Fully Tensorized Recurrent Neural NetworkCode1
A Speaker Verification Backend with Robust Performance across ConditionsCode1
ASVspoof 2021: Automatic Speaker Verification Spoofing and Countermeasures Challenge Evaluation PlanCode1
DropClass and DropAdapt: Dropping classes for deep speaker representation learningCode1
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio DetectionCode1
Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic ModelsCode1
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment UtterancesCode1
Efficient Attention Branch Network with Combined Loss Function for Automatic Speaker Verification Spoof DetectionCode1
End-to-end anti-spoofing with RawNet2Code1
Bias in Automated Speaker RecognitionCode1
Backdoor Attack against Speaker VerificationCode1
Bts-e: Audio deepfake detection using breathing-talking-silence encoderCode1
An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure SystemsCode1
Show:102550
← PrevPage 1 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Multi Task SSLEER1.98Unverified
2ReDimNet-B0-LM (1.0M)EER1.16Unverified
3TitanNet -SEER1.15Unverified
4ReDimNet-B0-LM-ASNorm (1.0M)EER1.07Unverified
5SpeechNASEER1.02Unverified
6ReDimNet-B1-LM (2.2M)EER0.85Unverified
7TitanNet -MEER0.81Unverified
8ReDimNet-B1-LM-ASNorm (2.2M)EER0.73Unverified
9TitanNet -LEER0.68Unverified
10ReDimNet-B2-SF2-LM (4.7M)EER0.57Unverified
#ModelMetricClaimedVerifiedStatus
1Fine-tuned HuBERT LargeEER2.36Unverified
2ReDimNet-B0-LM (1.0M)EER1.16Unverified
3ReDimNet-B0-LM-ASNorm (1.0M)EER1.07Unverified
4SpeechNASEER1.02Unverified
5ReDimNet-B1-LM (2.2M)EER0.85Unverified
6ReDimNet-B1-LM-ASNorm (2.2M)EER0.73Unverified
7ReDimNet-B2-SF2-LM (4.7M)EER0.57Unverified
8ReDimNet-B2-SF2-LM-ASNorm (4.7M)EER0.52Unverified
9ReDimNet-B4-LM (6.3M)EER0.51Unverified
10ReDimNet-B3-LM (3.0M)EER0.5Unverified
#ModelMetricClaimedVerifiedStatus
1GE2ECosine EER3.55Unverified
2Cosine EER2.38Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet with Attention BackendEER10.77Unverified
2X-Vectors with Attention BackendEER10.12Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA-TDNNminDCF0Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.01Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.03Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.02Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.08Unverified
#ModelMetricClaimedVerifiedStatus
1ECAPA2Test EER0.04Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50EER100Unverified