Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 248 papers

Title	Date	Tasks	Status
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models	Jan 24, 2025	Emotion ClassificationSpeaker Identification	—Unverified
Comparison of Gender- and Speaker-adaptive Emotion Recognition	May 1, 2014	AttributeEmotion Classification	—Unverified
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification	Jul 14, 2017	Speaker IdentificationSpeaker Recognition	—Unverified
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections	May 1, 2018	Active LearningFace Recognition	—Unverified
Computing with Hypervectors for Efficient Speaker Identification	Aug 28, 2022	CPUQuantization	—Unverified
Cosine similarity-based adversarial process	Jul 1, 2019	Speaker Identification	—Unverified
Cross-Lingual Speaker Identification from Weak Local Evidence	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
Curie: A method for protecting SVM Classifier from Poisoning Attack	Jun 5, 2016	BIG-bench Machine LearningSpeaker Identification	—Unverified
DASB -- Discrete Audio and Speech Benchmark	Jun 20, 2024	BenchmarkingEmotion Recognition	—Unverified
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data	Mar 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Delving into VoxCeleb: environment invariant speaker recognition	Oct 24, 2019	Speaker IdentificationSpeaker Recognition	—Unverified
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks	Dec 1, 2016	Dialect IdentificationInformation Retrieval	—Unverified
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods	Feb 26, 2024	Speaker Identification	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
Emirati-Accented Speaker Identification in Stressful Talking Conditions	Sep 28, 2019	Speaker Identification	—Unverified
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings	May 5, 2021	ClusteringSpeaker Identification	—Unverified
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis	Oct 16, 2023	Automatic Speech RecognitionDecoder	—Unverified
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification	Mar 13, 2020	Data AugmentationDenoising	—Unverified
End-to-End Speaker-Attributed ASR with Transformer	Apr 5, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample	Sep 24, 2024	Speaker IdentificationSpeaker Recognition	—Unverified
Ensemble knowledge distillation of self-supervised speech models	Feb 24, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Evaluating Speaker Identity Coding in Self-supervised Models and Humans	Jun 14, 2024	Speaker Identification	—Unverified
Evaluation of Automatic Formant Trackers	May 1, 2018	Speaker IdentificationSpeaker Recognition	—Unverified
ExARN: self-attending RNN for target speaker extraction	Dec 2, 2022	Speaker IdentificationTarget Speaker Extraction	—Unverified

Show:10 25 50

← PrevPage 9 of 10Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified