Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 248 papers

Title	Date	Tasks	Status
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework	Apr 9, 2024	Audio Classification	—Unverified
Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling	Apr 1, 2024	Speaker IdentificationSpeech Synthesis	—Unverified
Hearing-Loss Compensation Using Deep Neural Networks: A Framework and Results From a Listening Test	Mar 15, 2024	Music ClassificationSpeaker Identification	—Unverified
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement	Mar 3, 2024	Automatic Speech RecognitionKeyword Spotting	—Unverified
Unraveling Adversarial Examples against Speaker Identification -- Techniques for Attack Detection and Victim Model Classification	Feb 29, 2024	Adversarial AttackClassification	—Unverified
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods	Feb 26, 2024	Speaker Identification	—Unverified
Significance of Chirp MFCC as a Feature in Speech and Audio Applications	Feb 19, 2024	Music ClassificationSpeaker Identification	—Unverified
Probing Self-supervised Learning Models with Target Speech Extraction	Feb 17, 2024	Self-Supervised LearningSpeaker Identification	—Unverified
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis	Feb 11, 2024	RhythmSpeaker Identification	—Unverified
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models	Jan 23, 2024	Speaker IdentificationSpeaker Recognition	—Unverified
SIG: Speaker Identification in Literature via Prompt-Based Generation	Dec 22, 2023	Speaker Identification	CodeCode Available
Voxceleb-ESP: preliminary experiments detecting Spanish celebrities from their voices	Dec 20, 2023	Speaker IdentificationSpeaker Recognition	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
Privacy-preserving Representation Learning for Speech Understanding	Oct 26, 2023	ClassificationEmotion Recognition	—Unverified
Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition	Oct 17, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis	Oct 16, 2023	Automatic Speech RecognitionDecoder	—Unverified
Test-Time Training for Speech	Sep 19, 2023	parameter-efficient fine-tuningSpeaker Identification	—Unverified
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks	Sep 18, 2023	Keyword SpottingSpeaker Identification	—Unverified
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction	Sep 7, 2023	Keyword SpottingSelf-Supervised Learning	—Unverified
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification	Aug 22, 2023	Self-Supervised LearningSpeaker Identification	CodeCode Available
Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment	Jul 6, 2023	Speaker Identificationspeech-recognition	CodeCode Available
Read, Look or Listen? What's Needed for Solving a Multimodal Dataset	Jul 6, 2023	Question AnsweringSpeaker Identification	—Unverified
VoxWatch: An open-set speaker recognition benchmark on VoxCeleb	Jun 30, 2023	Speaker IdentificationSpeaker Recognition	—Unverified
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition	Jun 1, 2023	Meta-LearningSpeaker Identification	—Unverified
Few-Shot Speaker Identification Using Lightweight Prototypical Network with Feature Grouping and Interaction	May 31, 2023	Speaker Identification	—Unverified

Show:10 25 50

← PrevPage 4 of 10Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified