SOTAVerified|Agents Browse Leaderboard About Blog

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 435 papers

Title	Date	Tasks	Status
The OCON model: an old but green solution for distributable supervised classification for acoustic monitoring in smart cities	Oct 5, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample	Sep 24, 2024	Speaker IdentificationSpeaker Recognition	—Unverified
Avengers Assemble: Amalgamation of Non-Semantic Features for Depression Detection	Sep 22, 2024	Depression DetectionEmotion Recognition	—Unverified
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models	Sep 21, 2024	DeepFake DetectionFace Swapping	—Unverified
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels	Sep 16, 2024	Speaker RecognitionSpeaker Verification	—Unverified
oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models	Sep 16, 2024	Data AugmentationSpeaker Recognition	—Unverified
Text-To-Speech Synthesis In The Wild	Sep 13, 2024	BenchmarkingSpeaker Recognition	—Unverified
Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings	Aug 30, 2024	speaker-diarizationSpeaker Diarization	—Unverified
The VoxCeleb Speaker Recognition Challenge: A Retrospective	Aug 27, 2024	Domain AdaptationSpeaker Recognition	—Unverified
Convexity-based Pruning of Speech Representation Models	Aug 16, 2024	Keyword SpottingSelf-Supervised Learning	—Unverified
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation	Aug 1, 2024	Action DetectionActivity Detection	—Unverified
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning	Jul 21, 2024	Representation LearningSelf-Supervised Learning	—Unverified
Team HYU ASML ROBOVOX SP Cup 2024 System Description	Jul 16, 2024	Data AugmentationSpeaker Recognition	—Unverified
Phonetic Richness for Improved Automatic Speaker Verification	Jul 10, 2024	Speaker RecognitionSpeaker Verification	—Unverified
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states	Jul 9, 2024	ArticlesClassification	CodeCode Available
Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation	Jul 8, 2024	Automatic Speech RecognitionEmotion Recognition	—Unverified
We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings	Jul 5, 2024	Speaker RecognitionSpeech Synthesis	—Unverified
Prosody-Driven Privacy-Preserving Dementia Detection	Jul 3, 2024	AttributeDiagnostic	CodeCode Available
Open-Source Conversational AI with SpeechBrain 1.0	Jun 29, 2024	Language ModelingLanguage Modelling	—Unverified
CEC: A Noisy Label Detection Method for Speaker Recognition	Jun 19, 2024	Speaker RecognitionSpeaker Verification	—Unverified
Challenging margin-based speaker embedding extractors by using the variational information bottleneck	Jun 18, 2024	Speaker Recognition	—Unverified
PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation	Jun 10, 2024	Age EstimationEmotion Recognition	—Unverified
The Reasonable Effectiveness of Speaker Embeddings for Violence Detection	Jun 10, 2024	Speaker Recognition	—Unverified
Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting	May 30, 2024	Audio SynthesisRepresentation Learning	—Unverified
Speaker Characterization by means of Attention Pooling	May 7, 2024	Emotion RecognitionSpeaker Recognition	—Unverified

Show:10 25 50

← PrevPage 4 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	w2v2-aam	EER	1.88	—	Unverified
2	WavLM+ECAPA-TDNN	EER	0.39	—	Unverified