SOTAVerified

Speaker Identification

Papers

Showing 110 of 248 papers

TitleStatusHype
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
VoxBlink2: A 100K+ Speaker Recognition Corpus and the Open-Set Speaker-Identification BenchmarkCode5
SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space ModelCode2
SSAST: Self-Supervised Audio Spectrogram TransformerCode2
audino: A Modern Annotation Tool for Audio and SpeechCode2
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker EmbeddingsCode1
Disentangling Textual and Acoustic Features of Neural Speech RepresentationsCode1
ComiCap: A VLMs pipeline for dense captioning of Comic PanelsCode1
CoMix: A Comprehensive Benchmark for Multi-Task Comic UnderstandingCode1
InstructERC: Reforming Emotion Recognition in Conversation with Multi-task Retrieval-Augmented Large Language ModelsCode1
Show:102550
← PrevPage 1 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified