Speaker Identification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 248 papers

Title	Date	Tasks	Status
Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems	Jul 9, 2021	Representation LearningSpeaker Identification	—Unverified
QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus	Jun 24, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition	Jun 18, 2021	Speaker IdentificationSpeaker Recognition	—Unverified
Graph-based Label Propagation for Semi-Supervised Speaker Identification	Jun 15, 2021	Speaker IdentificationSpeaker Recognition	—Unverified
PF-Net: Personalized Filter for Speaker Recognition from Raw Waveform	May 31, 2021	Speaker IdentificationSpeaker Recognition	CodeCode Available
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings	May 5, 2021	ClusteringSpeaker Identification	—Unverified
Streaming Multi-talker Speech Recognition with Joint Speaker Identification	Apr 5, 2021	Speaker Identificationspeech-recognition	—Unverified
End-to-End Speaker-Attributed ASR with Transformer	Apr 5, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Survey on Paralinguistics in Tamil Speech Processing	Apr 1, 2021	Emotion RecognitionSpeaker Identification	—Unverified
Voice Privacy with Smart Digital Assistants in Educational Settings	Mar 24, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Triplet loss based embeddings for forensic speaker identification in Spanish	Feb 24, 2021	Speaker IdentificationTriplet	—Unverified
CASA-Based Speaker Identification Using Cascaded GMM-CNN Classifier in Noisy and Emotional Talking Conditions	Feb 11, 2021	Emotion RecognitionSpeaker Identification	—Unverified
Speaker attribution with voice profiles by graph-based semi-supervised learning	Feb 6, 2021	Speaker Identification	—Unverified
Attention-based multi-task learning for speech-enhancement and speaker-identification in multi-speaker dialogue scenario	Jan 7, 2021	Multi-Task LearningSpeaker Identification	CodeCode Available
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings	Jan 6, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Study of Few-Shot Audio Classification	Dec 2, 2020	Audio ClassificationBIG-bench Machine Learning	—Unverified
How Far Are We from Robust Voice Conversion: A Survey	Nov 24, 2020	Speaker IdentificationSurvey	—Unverified
Multi-Modal Emotion Detection with Transfer Learning	Nov 13, 2020	Speaker IdentificationTransfer Learning	—Unverified
T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model	Oct 29, 2020	Speaker Identification	—Unverified
Compositional embedding models for speaker identification and diarization with simultaneous speech from 2+ speakers	Oct 22, 2020	speaker-diarizationSpeaker Diarization	CodeCode Available
Contrastive Learning of General-Purpose Audio Representations	Oct 21, 2020	CoLAContrastive Learning	CodeCode Available
A Lightweight Speaker Recognition System Using Timbre Properties	Oct 12, 2020	GPUSpeaker Identification	—Unverified
Remarks on Optimal Scores for Speaker Recognition	Oct 10, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems	Jul 13, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers	Jun 19, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Integrated Replay Spoofing-aware Text-independent Speaker Verification	Jun 10, 2020	Multi-Task LearningSpeaker Identification	—Unverified
Speaker and Posture Classification using Instantaneous Intraspeech Breathing Features	May 25, 2020	Action DetectionActivity Detection	—Unverified
Identify Speakers in Cocktail Parties with End-to-End Attention	May 22, 2020	Speaker IdentificationSpeech Separation	CodeCode Available
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation	May 18, 2020	Self-Supervised LearningSpeaker Identification	CodeCode Available
Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification	May 15, 2020	Speaker Identification	—Unverified
Speaker Recognition in Bengali Language from Nonlinear Features	Apr 15, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
End-to-end Recurrent Denoising Autoencoder Embeddings for Speaker Identification	Mar 13, 2020	Data AugmentationDenoising	—Unverified
Deep Neural Networks for Automatic Speech Processing: A Survey from Large Corpora to Limited Data	Mar 9, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speaker Identification using EEG	Mar 7, 2020	EEGElectroencephalogram (EEG)	—Unverified
Multi-Task Learning with Auxiliary Speaker Identification for Conversational Emotion Recognition	Mar 3, 2020	Emotion Recognition in ConversationMulti-Task Learning	—Unverified
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention	Feb 14, 2020	Multi-Task LearningSpeaker Identification	—Unverified
Supervised Speaker Embedding De-Mixing in Two-Speaker Environment	Jan 14, 2020	Speaker IdentificationVocal Bursts Valence Prediction	—Unverified
Robust Speaker Recognition Using Speech Enhancement And Attention Model	Jan 14, 2020	Speaker IdentificationSpeaker Recognition	—Unverified
The Deterministic plus Stochastic Model of the Residual Signal and its Applications	Dec 29, 2019	Speaker IdentificationSpeech Synthesis	—Unverified
Advances in Online Audio-Visual Meeting Transcription	Dec 10, 2019	Sound Source Localizationspeaker-diarization	—Unverified
Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?	Nov 12, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Supervised Initialization of LSTM Networks for Fundamental Frequency Detection in Noisy Speech Signals	Nov 11, 2019	Speaker Identification	—Unverified
Reducing audio membership inference attack accuracy to chance: 4 defenses	Oct 31, 2019	Inference AttackMembership Inference Attack	—Unverified
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors	Oct 25, 2019	Speaker Identification	—Unverified
Delving into VoxCeleb: environment invariant speaker recognition	Oct 24, 2019	Speaker IdentificationSpeaker Recognition	—Unverified
Word-level Embeddings for Cross-Task Transfer Learning in Speech Processing	Oct 22, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model	Oct 17, 2019	Speaker Identification	—Unverified
Latent space representation for multi-target speaker detection and identification with a sparse dataset using Triplet neural networks	Oct 1, 2019	Speaker IdentificationSpeaker Recognition	CodeCode Available
Emirati-Accented Speaker Identification in Stressful Talking Conditions	Sep 28, 2019	Speaker Identification	—Unverified
Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model	Sep 24, 2019	Speaker IdentificationSpeaker Recognition	—Unverified

Show:10 25 50

← PrevPage 4 of 5Next →

All datasets VoxCeleb1 EVI en-GB EVI fr-FR EVI pl-PL

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	MSM-MAE	Top-1 (%)	96.6	—	Unverified
2	M2D/0.6	Top-1 (%)	96.5	—	Unverified
3	M2D/0.7	Top-1 (%)	96.3	—	Unverified
4	M2D ratio=0.6	Top-1 (%)	94.8	—	Unverified
5	AudioMAE (local)	Top-1 (%)	94.8	—	Unverified
6	ATST Base (ours)	Top-1 (%)	94.3	—	Unverified
7	AudioMAE (global)	Top-1 (%)	94.1	—	Unverified
8	AutoSpeech (N=8,C=128)	Top-1 (%)	87.66	—	Unverified
9	SSAST-FRAME	Top-1 (%)	80.8	—	Unverified
10	SSAMBA	Top-1 (%)	70.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	67.77	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	80.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Fuzzy Retrieval	Top-1 (%)	95.13	—	Unverified