SOTAVerified

A Rate-Distortion Framework for Explaining Black-box Model Decisions

2021-10-12Unverified0· sign in to hype

Stefan Kolek, Duc Anh Nguyen, Ron Levie, Joan Bruna, Gitta Kutyniok

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present the Rate-Distortion Explanation (RDE) framework, a mathematically well-founded method for explaining black-box model decisions. The framework is based on perturbations of the target input signal and applies to any differentiable pre-trained model such as neural networks. Our experiments demonstrate the framework's adaptability to diverse data modalities, particularly images, audio, and physical simulations of urban environments.

Tasks

Reproductions