SOTAVerified

Speaker Recognition

Speaker Recognition is the process of identifying or confirming the identity of a person given his speech segments.

Source: Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition

Papers

Showing 2650 of 435 papers

TitleStatusHype
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models0
Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels0
oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models0
Text-To-Speech Synthesis In The Wild0
USEF-TSE: Universal Speaker Embedding Free Target Speaker ExtractionCode1
Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings0
The VoxCeleb Speaker Recognition Challenge: A Retrospective0
Convexity-based Pruning of Speech Representation Models0
Long-Term Conversation Analysis: Privacy-Utility Trade-off under Noise and Reverberation0
VoxSim: A perceptual voice similarity datasetCode1
Reshape Dimensions Network for Speaker RecognitionCode2
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning0
Team HYU ASML ROBOVOX SP Cup 2024 System Description0
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
Phonetic Richness for Improved Automatic Speaker Verification0
A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative statesCode0
Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation0
We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings0
Prosody-Driven Privacy-Preserving Dementia DetectionCode0
Open-Source Conversational AI with SpeechBrain 1.00
CEC: A Noisy Label Detection Method for Speaker Recognition0
Challenging margin-based speaker embedding extractors by using the variational information bottleneck0
PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation0
The Reasonable Effectiveness of Speaker Embeddings for Violence Detection0
Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting0
Show:102550
← PrevPage 2 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1w2v2-aamEER1.88Unverified
2WavLM+ECAPA-TDNNEER0.39Unverified