SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 201250 of 435 papers

TitleStatusHype
Probing Deep Speaker Embeddings for Speaker-related Tasks0
Probing the Information Encoded in X-vectors0
QFA2SR: Query-Free Adversarial Transfer Attacks to Speaker Recognition Systems0
Quality Measures for Speaker Verification with Short Utterances0
Query Expansion System for the VoxCeleb Speaker Recognition Challenge 20200
Study on Inter and Intra Speaker Variability in Speaker Recognition0
A Benchmark for Understanding and Generating Dialogue between Characters in Stories0
A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments0
A Comparative Study of Pre-trained Speech and Audio Embeddings for Speech Emotion Recognition0
A comparative study of several parameterizations for speaker recognition0
A comparison of linear and non-linear calibrations for speaker recognition0
A Deep Neural Network for Short-Segment Speaker Recognition0
Adversarial defense for deep speaker recognition using hybrid adversarial training0
Adversarial Speaker Verification0
A Generative Model for Score Normalization in Speaker Recognition0
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion0
A Lightweight Speaker Recognition System Using Timbre Properties0
A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning0
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
Analysis of ABC Frontend Audio Systems for the NIST-SRE240
Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition0
Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation0
An Effortless Way To Create Large-Scale Datasets For Famous Speakers0
An Ensemble SVM-based Approach for Voice Activity Detection0
A new Speech Feature Fusion method with cross gate parallel CNN for Speaker Recognition0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
An improved uncertainty propagation method for robust i-vector based speaker recognition0
An I-vector Based Approach to Compact Multi-Granularity Topic Spaces Representation of Textual Documents0
A Novel Speech Feature Fusion Algorithm for Text-Independent Speaker Recognition0
基於數字文本相關之語者驗證系統的研究與實作 (Study and Implementation on Digit-related Speaker Verification) [In Chinese]0
Arabic Speech Rhythm Corpus: Read and Spontaneous Speaking Styles0
A Reinforcement Learning Framework for Online Speaker Diarization0
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models0
Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech0
AS2T: Arbitrary Source-To-Target Adversarial Attack on Speaker Recognition Systems0
Assurance Monitoring of Learning Enabled Cyber-Physical Systems Using Inductive Conformal Prediction based on Distance Learning0
A Study on Angular Based Embedding Learning for Text-independent Speaker Verification0
A Study on Bias and Fairness In Deep Speaker Recognition0
Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 20
Attacking Speaker Recognition With Deep Generative Models0
Audio Representation Learning by Distilling Video as Privileged Information0
Audio-to-Image Encoding for Improved Voice Characteristic Detection Using Deep Convolutional Neural Networks0
Audio-visual Speaker Recognition with a Cross-modal Discriminative Network0
Augmentation adversarial training for self-supervised speaker recognition0
A Unified Deep Neural Network for Speaker and Language Recognition0
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel0
Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection0
Bayesian calibration for forensic evidence reporting0
Black-box Adversarial Attacks on Commercial Speech Platforms with Minimal Information0
Blind score normalization method for PLDA based speaker recognition0
Show:102550
← PrevPage 5 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified