Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 248 papers

Title	Date	Tasks	Status
基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese]	Nov 1, 2017	Speaker Identification	—Unverified
A Preliminary Exploration with GPT-4o Voice Mode	Feb 14, 2025	Age ClassificationAudio Deepfake Detection	—Unverified
A Real-time Speaker Diarization System Based on Spatial Spectrum	Jul 20, 2021	speaker-diarizationSpeaker Diarization	—Unverified
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions	Oct 23, 2021	Speaker Identification	—Unverified
A Study of Few-Shot Audio Classification	Dec 2, 2020	Audio ClassificationBIG-bench Machine Learning	—Unverified
A Survey on Paralinguistics in Tamil Speech Processing	Apr 1, 2021	Emotion RecognitionSpeaker Identification	—Unverified
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR	Sep 9, 2024	Automatic Speech Recognitionspeaker-diarization	—Unverified
A user study to compare two conversational assistants designed for people with hearing impairments	Jun 1, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification	Nov 5, 2021	Speaker IdentificationSpeech Extraction	—Unverified
Can Musical Emotion Be Quantified With Neural Jitter Or Shimmer? A Novel EEG Based Study With Hindustani Classical Music	Apr 29, 2017	EEGElectroencephalogram (EEG)	—Unverified
CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions	Feb 11, 2021	Emotion RecognitionSpeaker Identification	—Unverified
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models	Jan 24, 2025	Emotion ClassificationSpeaker Identification	—Unverified
Comparison of Gender- and Speaker-adaptive Emotion Recognition	May 1, 2014	AttributeEmotion Classification	—Unverified
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification	Jul 14, 2017	Speaker IdentificationSpeaker Recognition	—Unverified
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections	May 1, 2018	Active LearningFace Recognition	—Unverified
Computing with Hypervectors for Efficient Speaker Identification	Aug 28, 2022	CPUQuantization	—Unverified
Cosine similarity-based adversarial process	Jul 1, 2019	Speaker Identification	—Unverified
Cross-Lingual Speaker Identification from Weak Local Evidence	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
Curie: A method for protecting SVM Classifier from Poisoning Attack	Jun 5, 2016	BIG-bench Machine LearningSpeaker Identification	—Unverified
DASB -- Discrete Audio and Speech Benchmark	Jun 20, 2024	BenchmarkingEmotion Recognition	—Unverified
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data	Mar 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models	Jul 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Delving into VoxCeleb: environment invariant speaker recognition	Oct 24, 2019	Speaker IdentificationSpeaker Recognition	—Unverified
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks	Dec 1, 2016	Dialect IdentificationInformation Retrieval	—Unverified
Effect of utterance duration and phonetic content on speaker identification using second-order statistical methods	Feb 26, 2024	Speaker Identification	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
Emirati-Accented Speaker Identification in Stressful Talking Conditions	Sep 28, 2019	Speaker Identification	—Unverified
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings	May 5, 2021	ClusteringSpeaker Identification	—Unverified
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis	Oct 16, 2023	Automatic Speech RecognitionDecoder	—Unverified
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification	Mar 13, 2020	Data AugmentationDenoising	—Unverified
End-to-End Speaker-Attributed ASR with Transformer	Apr 5, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample	Sep 24, 2024	Speaker IdentificationSpeaker Recognition	—Unverified
Streaming Multi-talker Speech Recognition with Joint Speaker Identification	Apr 5, 2021	Speaker Identificationspeech-recognition	—Unverified
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals	Nov 11, 2019	Speaker Identification	—Unverified
Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support	Apr 30, 2019	Speaker IdentificationVoice Conversion	—Unverified
Symmetric Saliency-based Adversarial Attack To Speaker Identification	Oct 30, 2022	Adversarial AttackDecoder	—Unverified
Test-Time Training for Speech	Sep 19, 2023	parameter-efficient fine-tuningSpeaker Identification	—Unverified
Text-based Speaker Identification on Multiparty Dialogues Using Multi-document Convolutional Neural Networks	Jul 1, 2017	Speaker IdentificationSpeech Recognition	—Unverified
Text Independent Speaker Identification System for Access Control	Sep 26, 2022	Speaker Identification	—Unverified
The Deterministic plus Stochastic Model of the Residual Signal and its Applications	Dec 29, 2019	Speaker IdentificationSpeech Synthesis	—Unverified
The DIRHA simulated corpus	May 1, 2014	Dialogue ManagementDistant Speech Recognition	—Unverified
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices	Dec 15, 2021	Speaker IdentificationVoice Conversion	—Unverified
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems	Jul 13, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
The RATS Collection: Supporting HLT Research with Degraded Audio Data	May 1, 2014	Action DetectionActivity Detection	—Unverified
TIMIT Speaker Profiling: A Comparison of Multi-task learning and Single-task learning Approaches	Apr 18, 2024	Age EstimationClassification	—Unverified
Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications	Nov 20, 2024	Emotion RecognitionSpeaker Identification	—Unverified
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR	Oct 7, 2021	Action DetectionActivity Detection	—Unverified
Triplet loss based embeddings for forensic speaker identification in Spanish	Feb 24, 2021	Speaker IdentificationTriplet	—Unverified
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model	Oct 29, 2020	Speaker Identification	—Unverified
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction	Sep 7, 2023	Keyword SpottingSelf-Supervised Learning	—Unverified

Show:10 25 50

← PrevPage 3 of 5Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified