Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 248 papers

Title	Date	Tasks	Status
Ordered and Binary Speaker Embedding	May 25, 2023	ClusteringRetrieval	—Unverified
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications	May 23, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Security and Privacy Problems in Voice Assistant Applications: A Survey	Apr 19, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Unsupervised Speech Representation Pooling Using Vector Quantization	Apr 8, 2023	Emotion Recognitionintent-classification	CodeCode Available
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones	Mar 13, 2023	Event DetectionSound Event Detection	—Unverified
Ensemble knowledge distillation of self-supervised speech models	Feb 24, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ExARN: self-attending RNN for target speaker extraction	Dec 2, 2022	Speaker IdentificationTarget Speaker Extraction	—Unverified
Multi-Label Training for Text-Independent Speaker Identification	Nov 14, 2022	Ensemble LearningSpeaker Identification	—Unverified
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples	Nov 10, 2022	De-identificationSpeaker Identification	—Unverified
Symmetric Saliency-based Adversarial Attack To Speaker Identification	Oct 30, 2022	Adversarial AttackDecoder	—Unverified
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input	Oct 26, 2022	Audio ClassificationAudio Tagging	—Unverified
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation	Oct 23, 2022	Speaker IdentificationSpeaker Separation	—Unverified
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG	Oct 23, 2022	Speaker Identification	—Unverified
Cross-Lingual Speaker Identification Using Distant Supervision	Oct 11, 2022	Language ModelingLanguage Modelling	CodeCode Available
Text Independent Speaker Identification System for Access Control	Sep 26, 2022	Speaker Identification	—Unverified
Computing with Hypervectors for Efficient Speaker Identification	Aug 28, 2022	CPUQuantization	—Unverified
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification	Jul 8, 2022	FairnessSpeaker Identification	—Unverified
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones	Jul 1, 2022	speaker-diarizationSpeaker Diarization	—Unverified
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre	Jun 29, 2022	DisentanglementSpeaker Identification	—Unverified
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems	Jun 18, 2022	Speaker IdentificationSpeaker Verification	—Unverified
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations	Jun 16, 2022	Speaker IdentificationSpeech Extraction	—Unverified
Speaker Identification using Speech Recognition	May 29, 2022	Speaker Identificationspeech-recognition	—Unverified
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information	May 8, 2022	Self-Supervised LearningSpeaker Identification	—Unverified
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution	May 6, 2022	BenchmarkingSpeaker Identification	—Unverified
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Apr 28, 2022	Speaker IdentificationSpeaker Verification	CodeCode Available
Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention	Apr 24, 2022	Audio ClassificationFew-Shot Learning	—Unverified
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment	Apr 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Listen only to me! How well can target speech extraction handle false alarms?	Apr 11, 2022	Speaker IdentificationSpeaker Verification	—Unverified
Karaoker: Alignment-free singing voice synthesis with speech training data	Apr 8, 2022	Singing Voice SynthesisSpeaker Identification	—Unverified
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification	Apr 8, 2022	Representation LearningSpeaker Identification	—Unverified
Improved Relation Networks for End-to-End Speaker Verification and Identification	Mar 31, 2022	Meta-LearningRelation	—Unverified
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification	Mar 29, 2022	Gender ClassificationSpeaker Identification	—Unverified
Speaker Identification Experiments Under Gender De-Identification	Mar 9, 2022	De-identificationSpeaker Identification	—Unverified
On the relevance of bandwidth extension for speaker identification	Feb 24, 2022	Bandwidth ExtensionSpeaker Identification	—Unverified
openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer	Feb 24, 2022	Open Set LearningSpeaker Identification	—Unverified
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings	Feb 23, 2022	ArticlesSpeaker Identification	—Unverified
Tubes Among Us: Analog Attack on Automatic Speaker Identification	Feb 6, 2022	BIG-bench Machine LearningSpeaker Identification	—Unverified
Cross-Lingual Speaker Identification from Weak Local Evidence	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices	Dec 15, 2021	Speaker IdentificationVoice Conversion	—Unverified
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification	Nov 5, 2021	Speaker IdentificationSpeech Extraction	—Unverified
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions	Oct 23, 2021	Speaker Identification	—Unverified
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation Extraction	Oct 3, 2021	Speaker IdentificationSpeaker Verification	CodeCode Available
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker Identification	Sep 9, 2021	ClusteringFew-Shot Learning	CodeCode Available
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets	Sep 6, 2021	Speaker Identification	—Unverified
Towards Making the Most of Dialogue Characteristics for Neural Chat Translation	Sep 2, 2021	Machine TranslationResponse Generation	CodeCode Available
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus	Aug 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Real-time Speaker Diarization System Based on Spatial Spectrum	Jul 20, 2021	speaker-diarizationSpeaker Diarization	—Unverified

Show:10 25 50

← PrevPage 3 of 5Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified