SOTAVerified

Speaker Identification

Papers

Showing 201248 of 248 papers

TitleStatusHype
A user study to compare two conversational assistants designed for people with hearing impairments0
Many-to-Many Voice Conversion with Out-of-Dataset Speaker Support0
Experiments on Open-Set Speaker Identification with Discriminatively Trained Neural Networks0
Advanced Rich Transcription System for Estonian Speech0
Learning Speaker Representations with Mutual InformationCode1
Histogram Transform-based Speaker Identification0
Speaker Recognition from Raw Waveform with SincNetCode1
Weakly Supervised Training of Speaker Identification Models0
On Learning Associations of Faces and VoicesCode0
VAST: A Corpus of Video Annotation for Speech Technologies0
Evaluation of Automatic Formant Trackers0
Computer-assisted Speaker Diarization: How to Evaluate Human Corrections0
Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction0
Matics Software Suite: New Tools for Evaluation and Data Exploration0
Seeing Voices and Hearing Faces: Cross-modal biometric matching0
Neural Predictive Coding using Convolutional Neural Networks towards Unsupervised Learning of Speaker Characteristics0
From Benedict Cumberbatch to Sherlock Holmes: Character Identification in TV series without a Script0
Speaker identification from the sound of the human breath0
基於聽覺感知模型之類神經網路及其在語者識別上之應用 (Two-stage Attentional Auditory Model Inspired Neural Network and Its Application to Speaker Identification) [In Chinese]0
Identifying Speakers and Listeners of Quoted Speech in Literary Works0
Story Comprehension for Predicting What Happens Next0
Comparison of Multiple Features and Modeling Methods for Text-dependent Speaker Verification0
Face Recognition with Machine Learning in OpenCV_ Fusion of the results with the Localization Data of an Acoustic Camera for Speaker Identification0
Text-based Speaker Identification on Multiparty Dialogues Using Multi-document Convolutional Neural Networks0
Speaker Identification in each of the Neutral and Shouted Talking Environments based on Gender-Dependent Approach Using SPHMMs0
Deep Speaker: an End-to-End Neural Speaker Embedding SystemCode0
Can Musical Emotion Be Quantified With Neural Jitter Or Shimmer? A Novel EEG Based Study With Hindustani Classical Music0
An Unsupervised Speaker Clustering Technique based on SOM and I-vectors for Speech Recognition Systems0
Discrimination between Similar Languages, Varieties and Dialects using CNN- and LSTM-based Deep Neural Networks0
A domain-agnostic approach for opinion prediction on speechCode0
Monaural Multi-Talker Speech Recognition using Factorial Speech Processing Models0
Curie: A method for protecting SVM Classifier from Poisoning Attack0
Look, Listen and Learn - A Multimodal LSTM for Speaker Identification0
A Novel Minimum Divergence Approach to Robust Speaker Identification0
Speaker Identification From Youtube Obtained Data0
基於稀疏表示之語者識別 (Sparse Representation Based Speaker Identification) [In Chinese]0
Invited Talk: IBM Cognitive Computing - An NLP Renaissance!0
On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels0
A Multi Level Data Fusion Approach for Speaker Identification on Telephone Speech0
Comparison of Gender- and Speaker-adaptive Emotion Recognition0
The DIRHA simulated corpus0
The RATS Collection: Supporting HLT Research with Degraded Audio Data0
From Speaker Identification to Affective Analysis: A Multi-Step System for Analyzing Children's Stories0
A Joint Model for Quotation Attribution and Coreference Resolution0
A Generative Product-of-Filters Model of AudioCode0
Identification of Speakers in Novels0
MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification0
L'identification du locuteur : 20 ans de t\'emoignage dans les cours de Justice. Le cas du LIPSADON laboratoire ind\'ependant de police scientifique (Forensic speaker identification: 20 years of scientific testimonies in courts of Justice. The case of LIPSADON ``forensics independent laboratory'') [in French]0
Show:102550
← PrevPage 5 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MSM-MAETop-1 (%)96.6Unverified
2M2D/0.6Top-1 (%)96.5Unverified
3M2D/0.7Top-1 (%)96.3Unverified
4M2D ratio=0.6Top-1 (%)94.8Unverified
5AudioMAE (local)Top-1 (%)94.8Unverified
6ATST Base (ours)Top-1 (%)94.3Unverified
7AudioMAE (global)Top-1 (%)94.1Unverified
8AutoSpeech (N=8,C=128)Top-1 (%)87.66Unverified
9SSAST-FRAMETop-1 (%)80.8Unverified
10SSAMBATop-1 (%)70.1Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)67.77Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)80.83Unverified
#ModelMetricClaimedVerifiedStatus
1Fuzzy RetrievalTop-1 (%)95.13Unverified