SOTAVerified

Speaker Identification

Papers

Showing 101150 of 248 papers

TitleStatusHype
Ordered and Binary Speaker Embedding0
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications0
Security and Privacy Problems in Voice Assistant Applications: A Survey0
Unsupervised Speech Representation Pooling Using Vector QuantizationCode0
HiSSNet: Sound Event Detection and Speaker Identification via Hierarchical Prototypical Networks for Low-Resource Headphones0
Ensemble knowledge distillation of self-supervised speech models0
ExARN: self-attending RNN for target speaker extraction0
Multi-Label Training for Text-Independent Speaker Identification0
Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples0
Symmetric Saliency-based Adversarial Attack To Speaker Identification0
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the InputCode0
Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation0
Speaker Identification from emotional and noisy speech data using learned voice segregation and Speech VGG0
Cross-Lingual Speaker Identification Using Distant SupervisionCode0
Text Independent Speaker Identification System for Access Control0
Computing with Hypervectors for Efficient Speaker Identification0
Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models0
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification0
Speaker Diarization and Identification from Single-Channel Classroom Audio Recording Using Virtual Microphones0
iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre0
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations0
Speaker Identification using Speech Recognition0
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information0
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution0
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and IdentificationCode0
Few-Shot Speaker Identification Using Depthwise Separable Convolutional Network with Channel Attention0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Listen only to me! How well can target speech extraction handle false alarms?0
Karaoker: Alignment-free singing voice synthesis with speech training data0
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification0
Improved Relation Networks for End-to-End Speaker Verification and Identification0
NeuraGen-A Low-Resource Neural Network based approach for Gender Classification0
Speaker Identification Experiments Under Gender De-Identification0
On the relevance of bandwidth extension for speaker identification0
openFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer0
Speech watermarking: an approach for the forensic analysis of digital telephonic recordings0
Tubes Among Us: Analog Attack on Automatic Speaker Identification0
Cross-Lingual Speaker Identification from Weak Local Evidence0
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices0
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification0
A Study of Acoustic Features in Arabic Speaker Identification under Noisy Environmental Conditions0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition0
PL-EESR: Perceptual Loss Based END-TO-END Robust Speaker Representation ExtractionCode0
Compositional Clustering: Applications to Multi-Label Object Recognition and Speaker IdentificationCode0
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets0
Towards Making the Most of Dialogue Characteristics for Neural Chat TranslationCode0
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus0
A Real-time Speaker Diarization System Based on Spatial Spectrum0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified