SOTAVerified

Speaker Identification

Papers

Showing 201225 of 248 papers

TitleStatusHype
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features0
Speaker attribution with voice profiles by graph-based semi-supervised learning0
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones0
Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues0
Speaker Identification Experiments Under Gender De-Identification0
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG0
Speaker identification from the sound of the human breath0
Speaker Identification From Youtube Obtained Data0
Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs0
Speaker Identification using EEG0
Speaker Identification using Speech Recognition0
Speaker Recognition in Bengali Language from Nonlinear Features0
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition0
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention0
Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization0
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis0
Speech Unlearning0
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings0
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks0
Story Comprehension for Predicting What Happens Next0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations0
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networksCode0
SIG: Speaker Identification in Literature via Prompt-Based GenerationCode0
Deep Learning for Speaker Identification: Architectural Insights from AB-1 Corpus Analysis and Performance EvaluationCode0
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken DialogueCode0
Show:102550
← PrevPage 9 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified