SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 226250 of 435 papers

TitleStatusHype
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
An improved uncertainty propagation method for robust i-vector based speaker recognition0
An I-vector Based Approach to Compact Multi-Granularity Topic Spaces Representation of Textual Documents0
A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition0
基於數字文本相關之語者驗證系統的研究與實作 (Study and Implementation on Digit-related Speaker Verification) [In Chinese]0
Arabic Speech Rhythm Corpus: Read and Spontaneous Speaking Styles0
A Reinforcement Learning Framework for Online Speaker Diarization0
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models0
Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech0
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems0
Assurance Monitoring of Learning Enabled Cyber-Physical Systems Using Inductive Conformal Prediction based on Distance Learning0
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification0
A Study on Bias and Fairness In Deep Speaker Recognition0
Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 20
Attacking Speaker Recognition With Deep Generative Models0
Audio Representation Learning by Distilling Video as Privileged Information0
Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks0
Audio-visual Speaker Recognition with a Cross-modal Discriminative Network0
Augmentation adversarial training for self-supervised speaker recognition0
A Unified Deep Neural Network for Speaker and Language Recognition0
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel0
Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection0
Bayesian calibration for forensic evidence reporting0
Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information0
Blind score normalization method for PLDA based speaker recognition0
Show:102550
← PrevPage 10 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified