Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 248 papers

Title	Date	Tasks	Status
Ordered and Binary Speaker Embedding	May 25, 2023	ClusteringRetrieval	—Unverified
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications	May 23, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Security and Privacy Problems in Voice Assistant Applications: A Survey	Apr 19, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Unsupervised Speech Representation Pooling Using Vector Quantization	Apr 8, 2023	Emotion Recognitionintent-classification	CodeCode Available
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified
Ensemble knowledge distillation of self-supervised speech models	Feb 24, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ExARN: self-attending RNN for target speaker extraction	Dec 2, 2022	Speaker IdentificationTarget Speaker Extraction	—Unverified
Multi-Label Training for Text-Independent Speaker Identification	Nov 14, 2022	Ensemble LearningSpeaker Identification	—Unverified
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples	Nov 10, 2022	De-identificationSpeaker Identification	—Unverified
Symmetric Saliency-based Adversarial Attack To Speaker Identification	Oct 30, 2022	Adversarial AttackDecoder	—Unverified
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input	Oct 26, 2022	Audio ClassificationAudio Tagging	—Unverified
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation	Oct 23, 2022	Speaker IdentificationSpeaker Separation	—Unverified
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG	Oct 23, 2022	Speaker Identification	—Unverified
Cross-Lingual Speaker Identification Using Distant Supervision	Oct 11, 2022	Language ModelingLanguage Modelling	CodeCode Available
Text Independent Speaker Identification System for Access Control	Sep 26, 2022	Speaker Identification	—Unverified
Computing with Hypervectors for Efficient Speaker Identification	Aug 28, 2022	CPUQuantization	—Unverified
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification	Jul 8, 2022	FairnessSpeaker Identification	—Unverified
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones	Jul 1, 2022	speaker-diarizationSpeaker Diarization	—Unverified
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre	Jun 29, 2022	DisentanglementSpeaker Identification	—Unverified
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems	Jun 18, 2022	Speaker IdentificationSpeaker Verification	—Unverified
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations	Jun 16, 2022	Speaker IdentificationSpeech Extraction	—Unverified
Speaker Identification using Speech Recognition	May 29, 2022	Speaker Identificationspeech-recognition	—Unverified
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information	May 8, 2022	Self-Supervised LearningSpeaker Identification	—Unverified
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution	May 6, 2022	BenchmarkingSpeaker Identification	—Unverified

Show:10 25 50

← PrevPage 5 of 10Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified