VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

2021-04-20Code Available1· sign in to hype

Pankaj Mishra, Riccardo Verk, Daniele Fornasier, Claudio Piciarelli, Gian Luca Foresti

Code Available — Be the first to reproduce this paper.

Code

github.com/pankajmishra000/VT-ADL
Officialpytorch★ 120

Abstract

We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.

Tasks

Anomaly Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
BTAD	VT-ADL	Segmentation AUROC	81.8	—	Unverified

VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Code

Abstract

Tasks

Benchmark Results

Reproductions