Be Careful When Evaluating Explanations Regarding Ground Truth

2023-11-08Code Available0· sign in to hype

Hubert Baniecki, Maciej Chrabaszcz, Andreas Holzinger, Bastian Pfeifer, Anna Saranti, Przemyslaw Biecek

Code Available — Be the first to reproduce this paper.

Code

github.com/mi2datalab/be-careful-evaluating-explanations
OfficialIn paperpytorch★ 0

Abstract

Evaluating explanations of image classifiers regarding ground truth, e.g. segmentation masks defined by human perception, primarily evaluates the quality of the models under consideration rather than the explanation methods themselves. Driven by this observation, we propose a framework for jointly evaluating the robustness of safety-critical systems that combine a deep neural network with an explanation method. These are increasingly used in real-world applications like medical image analysis or robotics. We introduce a fine-tuning procedure to (mis)align modelx2013explanation pipelines with ground truth and use it to quantify the potential discrepancy between worst and best-case scenarios of human alignment. Experiments across various model architectures and post-hoc local interpretation methods provide insights into the robustness of vision transformers and the overall vulnerability of such AI systems to potential adversarial attacks.

Tasks

Medical Image Analysis

Be Careful When Evaluating Explanations Regarding Ground Truth

Code

Abstract

Tasks

Reproductions