Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 248 papers

Title	Date	Tasks	Status
Ordered and Binary Speaker Embedding	May 25, 2023	ClusteringRetrieval	—Unverified
Tubes Among Us: Analog Attack on Automatic Speaker Identification	Feb 6, 2022	BIG-bench Machine LearningSpeaker Identification	—Unverified
PolInterviews -- A Dataset of German Politician Public Broadcast Interviews	Jan 8, 2025	Speaker Identification	—Unverified
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models	Jan 23, 2024	Speaker IdentificationSpeaker Recognition	—Unverified
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
Pretraining Multi-Speaker Identification for Neural Speaker Diarization	May 30, 2025	speaker-diarizationSpeaker Diarization	—Unverified
Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?	Nov 12, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Privacy-preserving Representation Learning for Speech Understanding	Oct 26, 2023	ClassificationEmotion Recognition	—Unverified
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples	Nov 10, 2022	De-identificationSpeaker Identification	—Unverified
Probing Self-supervised Learning Models with Target Speech Extraction	Feb 17, 2024	Self-Supervised LearningSpeaker Identification	—Unverified
Progressive Residual Extraction based Pre-training for Speech Representation Learning	Aug 31, 2024	Emotion RecognitionRepresentation Learning	—Unverified
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus	Jun 24, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus	Aug 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation	Oct 23, 2022	Speaker IdentificationSpeaker Separation	—Unverified
Quantized Approximate Signal Processing (QASP): Towards Homomorphic Encryption for audio	May 15, 2025	Speaker Identificationspeech-recognition	—Unverified
Read, Look or Listen? What's Needed for Solving a Multimodal Dataset	Jul 6, 2023	Question AnsweringSpeaker Identification	—Unverified
Reducing audio membership inference attack accuracy to chance: 4 defenses	Oct 31, 2019	Inference AttackMembership Inference Attack	—Unverified
Remarks on Optimal Scores for Speaker Recognition	Oct 10, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling	Apr 1, 2024	Speaker IdentificationSpeech Synthesis	—Unverified
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems	Jul 9, 2021	Representation LearningSpeaker Identification	—Unverified
REWIND: Speech Time Reversal for Enhancing Speaker Representations in Diffusion-based Voice Conversion	May 27, 2025	DisentanglementSpeaker Identification	—Unverified
Rhythm Features for Speaker Identification	Jun 7, 2025	Deep LearningRhythm	—Unverified
Robust Speaker Recognition Using Speech Enhancement And Attention Model	Jan 14, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
SCDiar: a streaming diarization system based on speaker change detection and speech recognition	Jan 28, 2025	Change Detectionspeaker-diarization	—Unverified
Security and Privacy Problems in Voice Assistant Applications: A Survey	Apr 19, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Seeing Voices and Hearing Faces: Cross-modal biometric matching	Apr 1, 2018	Face RecognitionSpeaker Identification	—Unverified
Significance of Chirp MFCC as a Feature in Speech and Audio Applications	Feb 19, 2024	Music ClassificationSpeaker Identification	—Unverified
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information	May 8, 2022	Self-Supervised LearningSpeaker Identification	—Unverified
基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese]	Oct 1, 2014	Dimensionality ReductionSpeaker Identification	—Unverified
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features	May 25, 2020	Action DetectionActivity Detection	—Unverified
Speaker attribution with voice profiles by graph-based semi-supervised learning	Feb 6, 2021	Speaker Identification	—Unverified
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones	Jul 1, 2022	speaker-diarizationSpeaker Diarization	—Unverified
Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues	Apr 21, 2025	BenchmarkingSpeaker Identification	—Unverified
Speaker Identification Experiments Under Gender De-Identification	Mar 9, 2022	De-identificationSpeaker Identification	—Unverified
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG	Oct 23, 2022	Speaker Identification	—Unverified
Speaker identification from the sound of the human breath	Dec 1, 2017	Speaker IdentificationSpeaker Recognition	—Unverified
Speaker Identification From Youtube Obtained Data	Nov 11, 2014	parameter estimationQuantization	—Unverified
Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs	Jun 29, 2017	Speaker Identification	—Unverified
Speaker Identification using EEG	Mar 7, 2020	EEGElectroencephalogram (EEG)	—Unverified
Speaker Identification using Speech Recognition	May 29, 2022	Speaker Identificationspeech-recognition	—Unverified
Speaker Recognition in Bengali Language from Nonlinear Features	Apr 15, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition	Jun 1, 2023	Meta-LearningSpeaker Identification	—Unverified
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention	Feb 14, 2020	Multi-Task LearningSpeaker Identification	—Unverified
Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization	Feb 18, 2025	Automatic Speech RecognitionSpeaker Identification	—Unverified
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis	Feb 11, 2024	RhythmSpeaker Identification	—Unverified
Speech Unlearning	Jun 1, 2025	Adversarial RobustnessKeyword Spotting	—Unverified
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings	Feb 23, 2022	ArticlesSpeaker Identification	—Unverified
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks	Sep 18, 2023	Keyword SpottingSpeaker Identification	—Unverified
Story Comprehension for Predicting What Happens Next	Sep 1, 2017	Common Sense ReasoningNatural Language Understanding	—Unverified
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations	Jun 16, 2022	Speaker IdentificationSpeech Extraction	—Unverified

Show:10 25 50

← PrevPage 3 of 5Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified