Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 248 papers

Title	Date	Tasks	Status
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Apr 28, 2022	Speaker IdentificationSpeaker Verification	CodeCode Available
Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention	Apr 24, 2022	Audio ClassificationFew-Shot Learning	—Unverified
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment	Apr 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Listen only to me! How well can target speech extraction handle false alarms?	Apr 11, 2022	Speaker IdentificationSpeaker Verification	—Unverified
Karaoker: Alignment-free singing voice synthesis with speech training data	Apr 8, 2022	Singing Voice SynthesisSpeaker Identification	—Unverified
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification	Apr 8, 2022	Representation LearningSpeaker Identification	—Unverified
Improved Relation Networks for End-to-End Speaker Verification and Identification	Mar 31, 2022	Meta-LearningRelation	—Unverified
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification	Mar 29, 2022	Gender ClassificationSpeaker Identification	—Unverified
Speaker Identification Experiments Under Gender De-Identification	Mar 9, 2022	De-identificationSpeaker Identification	—Unverified
On the relevance of bandwidth extension for speaker identification	Feb 24, 2022	Bandwidth ExtensionSpeaker Identification	—Unverified
openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer	Feb 24, 2022	Open Set LearningSpeaker Identification	—Unverified
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings	Feb 23, 2022	ArticlesSpeaker Identification	—Unverified
Tubes Among Us: Analog Attack on Automatic Speaker Identification	Feb 6, 2022	BIG-bench Machine LearningSpeaker Identification	—Unverified
Cross-Lingual Speaker Identification from Weak Local Evidence	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices	Dec 15, 2021	Speaker IdentificationVoice Conversion	—Unverified
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification	Nov 5, 2021	Speaker IdentificationSpeech Extraction	—Unverified
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions	Oct 23, 2021	Speaker Identification	—Unverified
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction	Oct 3, 2021	Speaker IdentificationSpeaker Verification	CodeCode Available
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification	Sep 9, 2021	ClusteringFew-Shot Learning	CodeCode Available
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets	Sep 6, 2021	Speaker Identification	—Unverified
Towards Making the Most of Dialogue Characteristics for Neural Chat Translation	Sep 2, 2021	Machine TranslationResponse Generation	CodeCode Available
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus	Aug 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Real-time Speaker Diarization System Based on Spatial Spectrum	Jul 20, 2021	speaker-diarizationSpeaker Diarization	—Unverified

Show:10 25 50

← PrevPage 6 of 10Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified