Uncertainty Estimation of Transformer Predictions for Misclassification Detection

2022-05-01ACL 2022Code Available0· sign in to hype

Artem Vazhentsev, Gleb Kuzmin, Artem Shelmanov, Akim Tsvigun, Evgenii Tsymbalov, Kirill Fedyanin, Maxim Panov, Alexander Panchenko, Gleb Gusev, Mikhail Burtsev, Manvel Avetisian, Leonid Zhukov

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/airi-institute/uncertainty_transformers
OfficialIn paperpytorch★ 4

Abstract

Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification detection, adversarial attack detection, out-of-distribution detection, etc. Most of the works on modeling the uncertainty of deep neural networks evaluate these methods on image classification tasks. Little attention has been paid to UE in natural language processing. To fill this gap, we perform a vast empirical investigation of state-of-the-art UE methods for Transformer models on misclassification detection in named entity recognition and text classification tasks and propose two computationally efficient modifications, one of which approaches or even outperforms computationally intensive methods.

Tasks

Active Learning Adversarial Attack Adversarial Attack Detection Classification image-classification Image Classification named-entity-recognition Named Entity Recognition Named Entity Recognition (NER)Out-of-Distribution Detection text-classification Text Classification

Uncertainty Estimation of Transformer Predictions for Misclassification Detection

Code

Abstract

Tasks

Reproductions