Null-sampling for Interpretable and Fair Representations

2020-08-12ECCV 2020Code Available0· sign in to hype

Thomas Kehrenberg, Myles Bartlett, Oliver Thomas, Novi Quadrianto

Code Available — Be the first to reproduce this paper.

Code

github.com/predictive-analytics-lab/nifr
OfficialIn paperpytorch★ 9

Abstract

We propose to learn invariant representations, in the data domain, to achieve interpretability in algorithmic fairness. Invariance implies a selectivity for high level, relevant correlations w.r.t. class label annotations, and a robustness to irrelevant correlations with protected characteristics such as race or gender. We introduce a non-trivial setup in which the training set exhibits a strong bias such that class label annotations are irrelevant and spurious correlations cannot be distinguished. To address this problem, we introduce an adversarially trained model with a null-sampling procedure to produce invariant representations in the data domain. To enable disentanglement, a partially-labelled representative set is used. By placing the representations into the data domain, the changes made by the model are easily examinable by human auditors. We show the effectiveness of our method on both image and tabular datasets: Coloured MNIST, the CelebA and the Adult dataset.

Tasks

Disentanglement Fairness Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CelebA 64x64	cFlow	Accuracy	0.82	—	Unverified
CelebA 64x64	cVAE	Accuracy	0.81	—	Unverified
CelebA 64x64	CNN	Accuracy	0.67	—	Unverified

Null-sampling for Interpretable and Fair Representations

Code

Abstract

Tasks

Benchmark Results

Reproductions