SOTAVerified

Speaker Identification

Papers

Showing 101125 of 248 papers

TitleStatusHype
Ordered and Binary Speaker Embedding0
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications0
Security and Privacy Problems in Voice Assistant Applications: A Survey0
Unsupervised Speech Representation Pooling Using Vector QuantizationCode0
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones0
Ensemble knowledge distillation of self-supervised speech models0
ExARN: self-attending RNN for target speaker extraction0
Multi-Label Training for Text-Independent Speaker Identification0
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples0
Symmetric Saliency-based Adversarial Attack To Speaker Identification0
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the InputCode0
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation0
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG0
Cross-Lingual Speaker Identification Using Distant SupervisionCode0
Text Independent Speaker Identification System for Access Control0
Computing with Hypervectors for Efficient Speaker Identification0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models0
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification0
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones0
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre0
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations0
Speaker Identification using Speech Recognition0
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information0
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution0
Show:102550
← PrevPage 5 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified