SOTAVerified

Speaker Identification

Papers

Showing 111120 of 248 papers

TitleStatusHype
Exploring VQ-VAE with Prosody Parameters for Speaker Anonymization0
基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese]0
Experiments on Open-Set Speaker Identification with Discriminatively Trained Neural Networks0
Invited Talk: IBM Cognitive Computing - An NLP Renaissance!0
ExARN: self-attending RNN for target speaker extraction0
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers0
Karaoker: Alignment-free singing voice synthesis with speech training data0
Large-Scale Speaker Diarization of Radio Broadcast Archives0
Characteristic-Specific Partial Fine-Tuning for Efficient Emotion and Speaker Adaptation in Codec Language Text-to-Speech Models0
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement0
Show:102550
← PrevPage 12 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified