Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 248 papers

Title	Date	Tasks	Status	Score
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers	Oct 22, 2020	speaker-diarizationSpeaker Diarization	CodeCode Available	5
Cross-Lingual Speaker Identification Using Distant Supervision	Oct 11, 2022	Language ModelingLanguage Modelling	CodeCode Available	5
Contrastive Learning of General-Purpose Audio Representations	Oct 21, 2020	CoLAContrastive Learning	CodeCode Available	5
Unsupervised Speech Representation Pooling Using Vector Quantization	Apr 8, 2023	Emotion Recognitionintent-classification	CodeCode Available	5
Towards Making the Most of Dialogue Characteristics for Neural Chat Translation	Sep 2, 2021	Machine TranslationResponse Generation	CodeCode Available	5
Towards Speaker Identification with Minimal Dataset and Constrained Resources using 1D-Convolution Neural Network	Nov 22, 2024	Data AugmentationSpeaker Identification	CodeCode Available	5
Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing	Oct 22, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	5
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification	Sep 9, 2021	ClusteringFew-Shot Learning	CodeCode Available	5
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding	Dec 23, 2024	Speaker Identification	CodeCode Available	5
PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform	May 31, 2021	Speaker IdentificationSpeaker Recognition	CodeCode Available	5
On Learning Associations of Faces and Voices	May 15, 2018	Speaker Identification	CodeCode Available	5
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction	Oct 3, 2021	Speaker IdentificationSpeaker Verification	CodeCode Available	5
CoLMbo: Speaker Language Model for Descriptive Profiling	Jun 11, 2025	DescriptiveLanguage Modeling	CodeCode Available	5
A Generative Product-of-Filters Model of Audio	Dec 20, 2013	modelSpeaker Identification	CodeCode Available	5
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Apr 28, 2022	Speaker IdentificationSpeaker Verification	CodeCode Available	5
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue	Sep 7, 2024	Question AnsweringSpeaker Identification	CodeCode Available	5
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks	Oct 1, 2019	Speaker IdentificationSpeaker Recognition	CodeCode Available	5
Identifying Speakers in Dialogue Transcripts: A Text-based Approach Using Pretrained Language Models	Jul 16, 2024	AttributeSpeaker Identification	CodeCode Available	5
A domain-agnostic approach for opinion prediction on speech	Dec 1, 2016	Emotion RecognitionFeature Engineering	CodeCode Available	5
Identify Speakers in Cocktail Parties with End-to-End Attention	May 22, 2020	Speaker IdentificationSpeech Separation	CodeCode Available	5
SIG: Speaker Identification in Literature via Prompt-Based Generation	Dec 22, 2023	Speaker Identification	CodeCode Available	5
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings	May 5, 2021	ClusteringSpeaker Identification	—Unverified	0
Emirati-Accented Speaker Identification in Stressful Talking Conditions	Sep 28, 2019	Speaker Identification	—Unverified	0
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified	0
A user study to compare two conversational assistants designed for people with hearing impairments	Jun 1, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods	Feb 26, 2024	Speaker Identification	—Unverified	0
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks	Dec 1, 2016	Dialect IdentificationInformation Retrieval	—Unverified	0
Delving into VoxCeleb: environment invariant speaker recognition	Oct 24, 2019	Speaker IdentificationSpeaker Recognition	—Unverified	0
A Multi Level Data Fusion Approach for Speaker Identification on Telephone Speech	Jun 27, 2014	Speaker Identification	—Unverified	0
Advances in Online Audio-Visual Meeting Transcription	Dec 10, 2019	Sound Source Localizationspeaker-diarization	—Unverified	0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data	Mar 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
DASB -- Discrete Audio and Speech Benchmark	Jun 20, 2024	BenchmarkingEmotion Recognition	—Unverified	0
Curie: A method for protecting SVM Classifier from Poisoning Attack	Jun 5, 2016	BIG-bench Machine LearningSpeaker Identification	—Unverified	0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR	Sep 9, 2024	Automatic Speech Recognitionspeaker-diarization	—Unverified	0
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings	Jan 6, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
HPP-Voice: A Large-Scale Evaluation of Speech Embeddings for Multi-Phenotypic Classification	May 22, 2025	speaker-diarizationSpeaker Diarization	—Unverified	0
Cross-Lingual Speaker Identification from Weak Local Evidence	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified	0
A Survey on Paralinguistics in Tamil Speech Processing	Apr 1, 2021	Emotion RecognitionSpeaker Identification	—Unverified	0
Advanced Rich Transcription System for Estonian Speech	Jan 11, 2019	Speaker Identification	—Unverified	0
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors	Oct 25, 2019	Speaker Identification	—Unverified	0
How Redundant Is the Transformer Stack in Speech Representation Models?	Sep 10, 2024	Knowledge DistillationSpeaker Identification	—Unverified	0
How Far Are We from Robust Voice Conversion: A Survey	Nov 24, 2020	Speaker IdentificationSurvey	—Unverified	0
Cosine similarity-based adversarial process	Jul 1, 2019	Speaker Identification	—Unverified	0
Histogram Transform-based Speaker Identification	Aug 2, 2018	Speaker Identification	—Unverified	0
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model	Oct 17, 2019	Speaker Identification	—Unverified	0
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified	0
Identification of Speakers in Novels	Aug 1, 2013	Speaker Identification	—Unverified	0
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems	Jun 18, 2022	Speaker IdentificationSpeaker Verification	—Unverified	0
A Study of Few-Shot Audio Classification	Dec 2, 2020	Audio ClassificationBIG-bench Machine Learning	—Unverified	0

Show:10 25 50

← PrevPage 2 of 5Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified