SOTAVerified

Speaker Identification

Papers

Showing 51100 of 248 papers

TitleStatusHype
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data0
Advanced Rich Transcription System for Estonian Speech0
Integrated Replay Spoofing-aware Text-independent Speaker Verification0
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
DASB -- Discrete Audio and Speech Benchmark0
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods0
Efficiency-oriented approaches for self-supervised speech representation learning0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
Curie: A method for protecting SVM Classifier from Poisoning Attack0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
Cross-Lingual Speaker Identification from Weak Local Evidence0
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model0
HPP-Voice: A Large-Scale Evaluation of Speech Embeddings for Multi-Phenotypic Classification0
A Survey on Paralinguistics in Tamil Speech Processing0
Karaoker: Alignment-free singing voice synthesis with speech training data0
Cosine similarity-based adversarial process0
A Study of Few-Shot Audio Classification0
A Lightweight Speaker Recognition System Using Timbre Properties0
Computing with Hypervectors for Efficient Speaker Identification0
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections0
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions0
Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition0
From Dialect Gaps to Identity Maps: Tackling Variability in Speaker Verification0
Improved Relation Networks for End-to-End Speaker Verification and Identification0
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification0
A Real-time Speaker Diarization System Based on Spatial Spectrum0
French Listening Tests for the Assessment of Intelligibility, Quality, and Identity of Body-Conducted Speech Enhancement0
Comparison of Gender- and Speaker-adaptive Emotion Recognition0
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre0
From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script0
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model0
From Speaker Identification to Affective Analysis: A Multi-Step System for Analyzing Children's Stories0
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition0
A Joint Model for Quotation Attribution and Coreference Resolution0
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets0
Few-Shot Speaker Identification Using Lightweight Prototypical Network with Feature Grouping and Interaction0
Graph-based Label Propagation for Semi-Supervised Speaker Identification0
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification0
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones0
Histogram Transform-based Speaker Identification0
How Far Are We from Robust Voice Conversion: A Survey0
How Redundant Is the Transformer Stack in Speech Representation Models?0
Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention0
Face Recognition with Machine Learning in OpenCV_ Fusion of the results with the Localization Data of an Acoustic Camera for Speaker Identification0
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings0
Identification of Speakers in Novels0
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems0
A Preliminary Exploration with GPT-4o Voice Mode0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified