SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 351375 of 435 papers

TitleStatusHype
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
Analysis of ABC Frontend Audio Systems for the NIST-SRE240
Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition0
Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation0
An Effortless Way To Create Large-Scale Datasets For Famous Speakers0
An Ensemble SVM-based Approach for Voice Activity Detection0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
An improved uncertainty propagation method for robust i-vector based speaker recognition0
An I-vector Based Approach to Compact Multi-Granularity Topic Spaces Representation of Textual Documents0
A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition0
基於數字文本相關之語者驗證系統的研究與實作 (Study and Implementation on Digit-related Speaker Verification) [In Chinese]0
Arabic Speech Rhythm Corpus: Read and Spontaneous Speaking Styles0
A Reinforcement Learning Framework for Online Speaker Diarization0
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models0
Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech0
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems0
Assurance Monitoring of Learning Enabled Cyber-Physical Systems Using Inductive Conformal Prediction based on Distance Learning0
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification0
A Study on Bias and Fairness In Deep Speaker Recognition0
Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 20
Attacking Speaker Recognition With Deep Generative Models0
Audio Representation Learning by Distilling Video as Privileged Information0
Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks0
Audio-visual Speaker Recognition with a Cross-modal Discriminative Network0
Show:102550
← PrevPage 15 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified