Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–248 of 248 papers

Title	Date	Tasks	Status
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features	May 25, 2020	Action DetectionActivity Detection	—Unverified
Speaker attribution with voice profiles by graph-based semi-supervised learning	Feb 6, 2021	Speaker Identification	—Unverified
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones	Jul 1, 2022	speaker-diarizationSpeaker Diarization	—Unverified
Speaker Fuzzy Fingerprints: Benchmarking Text-Based Identification in Multiparty Dialogues	Apr 21, 2025	BenchmarkingSpeaker Identification	—Unverified
Speaker Identification Experiments Under Gender De-Identification	Mar 9, 2022	De-identificationSpeaker Identification	—Unverified
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG	Oct 23, 2022	Speaker Identification	—Unverified
Speaker identification from the sound of the human breath	Dec 1, 2017	Speaker IdentificationSpeaker Recognition	—Unverified
Speaker Identification From Youtube Obtained Data	Nov 11, 2014	parameter estimationQuantization	—Unverified
Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs	Jun 29, 2017	Speaker Identification	—Unverified
Speaker Identification using EEG	Mar 7, 2020	EEGElectroencephalogram (EEG)	—Unverified
Speaker Identification using Speech Recognition	May 29, 2022	Speaker Identificationspeech-recognition	—Unverified
Speaker Recognition in Bengali Language from Nonlinear Features	Apr 15, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
Meta-Learning Framework for End-to-End Imposter Identification in Unseen Speaker Recognition	Jun 1, 2023	Meta-LearningSpeaker Identification	—Unverified
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention	Feb 14, 2020	Multi-Task LearningSpeaker Identification	—Unverified
Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization	Feb 18, 2025	Automatic Speech RecognitionSpeaker Identification	—Unverified
Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis	Feb 11, 2024	RhythmSpeaker Identification	—Unverified
Speech Unlearning	Jun 1, 2025	Adversarial RobustnessKeyword Spotting	—Unverified
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings	Feb 23, 2022	ArticlesSpeaker Identification	—Unverified
Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks	Sep 18, 2023	Keyword SpottingSpeaker Identification	—Unverified
Story Comprehension for Predicting What Happens Next	Sep 1, 2017	Common Sense ReasoningNatural Language Understanding	—Unverified
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations	Jun 16, 2022	Speaker IdentificationSpeech Extraction	—Unverified
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks	Oct 1, 2019	Speaker IdentificationSpeaker Recognition	CodeCode Available
SIG: Speaker Identification in Literature via Prompt-Based Generation	Dec 22, 2023	Speaker Identification	CodeCode Available
Deep Learning for Speaker Identification: Architectural Insights from AB-1 Corpus Analysis and Performance Evaluation	Aug 13, 2024	Speaker Identification	CodeCode Available
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue	Sep 7, 2024	Question AnsweringSpeaker Identification	CodeCode Available
Identify Speakers in Cocktail Parties with End-to-End Attention	May 22, 2020	Speaker IdentificationSpeech Separation	CodeCode Available
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Apr 28, 2022	Speaker IdentificationSpeaker Verification	CodeCode Available
CoLMbo: Speaker Language Model for Descriptive Profiling	Jun 11, 2025	DescriptiveLanguage Modeling	CodeCode Available
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation	May 18, 2020	Self-Supervised LearningSpeaker Identification	CodeCode Available
Cross-Lingual Speaker Identification Using Distant Supervision	Oct 11, 2022	Language ModelingLanguage Modelling	CodeCode Available
A domain-agnostic approach for opinion prediction on speech	Dec 1, 2016	Emotion RecognitionFeature Engineering	CodeCode Available
Contrastive Learning of General-Purpose Audio Representations	Oct 21, 2020	CoLAContrastive Learning	CodeCode Available
Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models	Jul 16, 2024	AttributeSpeaker Identification	CodeCode Available
On Learning Associations of Faces and Voices	May 15, 2018	Speaker Identification	CodeCode Available
Towards Making the Most of Dialogue Characteristics for Neural Chat Translation	Sep 2, 2021	Machine TranslationResponse Generation	CodeCode Available
Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing	Oct 22, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform	May 31, 2021	Speaker IdentificationSpeaker Recognition	CodeCode Available
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario	Jan 7, 2021	Multi-Task LearningSpeaker Identification	CodeCode Available
Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network	Nov 22, 2024	Data AugmentationSpeaker Identification	CodeCode Available
Gammatonegram Representation for End-to-End Dysarthric Speech Processing Tasks: Speech Recognition, Speaker Identification, and Intelligibility Assessment	Jul 6, 2023	Speaker Identificationspeech-recognition	CodeCode Available
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers	Oct 22, 2020	speaker-diarizationSpeaker Diarization	CodeCode Available
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction	Oct 3, 2021	Speaker IdentificationSpeaker Verification	CodeCode Available
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification	Sep 9, 2021	ClusteringFew-Shot Learning	CodeCode Available
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding	Dec 23, 2024	Speaker Identification	CodeCode Available
An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification	Aug 22, 2023	Self-Supervised LearningSpeaker Identification	CodeCode Available
Deep Speaker: an End-to-End Neural Speaker Embedding System	May 5, 2017	ClusteringSpeaker Identification	CodeCode Available
A Generative Product-of-Filters Model of Audio	Dec 20, 2013	modelSpeaker Identification	CodeCode Available
Unsupervised Speech Representation Pooling Using Vector Quantization	Apr 8, 2023	Emotion Recognitionintent-classification	CodeCode Available

Show:10 25 50

← PrevPage 5 of 5Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified