SOTAVerified

Speaker Identification

Papers

Showing 2650 of 248 papers

TitleStatusHype
A Modulation-Domain Loss for Neural-Network-based Real-time Speech EnhancementCode1
Generative Pre-Training for Speech with Autoregressive Predictive CodingCode1
Whisper Speaker Identification: Leveraging Pre-Trained Multilingual Transformers for Robust Speaker EmbeddingsCode1
Blind Speech Separation and Dereverberation using Neural BeamformingCode1
Disentangling Textual and Acoustic Features of Neural Speech RepresentationsCode1
Extended U-Net for Speaker Verification in Noisy EnvironmentsCode1
GIFT: Graph-Induced Fine-Tuning for Multi-Party Conversation UnderstandingCode1
FastAudio: A Learnable Audio Front-End for Spoof Speech DetectionCode1
FoolHD: Fooling speaker identification by Highly imperceptible adversarial DisturbancesCode1
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeamCode1
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languagesCode1
ComiCap: A VLMs pipeline for dense captioning of Comic PanelsCode1
CoMix: A Comprehensive Benchmark for Multi-Task Comic UnderstandingCode1
Masked Autoencoders that ListenCode1
MelHuBERT: A simplified HuBERT on Mel spectrogramsCode1
ASiT: Local-Global Audio Spectrogram vIsion Transformer for Event ClassificationCode1
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASRCode1
Non-uniform Speaker Disentanglement For Depression Detection From Raw Speech SignalsCode1
Learning Speaker Representations with Mutual InformationCode1
A user study to compare two conversational assistants designed for people with hearing impairments0
Advances in Online Audio-Visual Meeting Transcription0
A Multi Level Data Fusion Approach for Speaker Identification on Telephone Speech0
Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors0
Emirati-Accented Speaker Identification in Stressful Talking Conditions0
Advanced Rich Transcription System for Estonian Speech0
Show:102550
← PrevPage 2 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified