VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization
2021-04-20Code Available1· sign in to hype
Pankaj Mishra, Riccardo Verk, Daniele Fornasier, Claudio Piciarelli, Gian Luca Foresti
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/pankajmishra000/VT-ADLOfficialpytorch★ 120
Abstract
We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| BTAD | VT-ADL | Segmentation AUROC | 81.8 | — | Unverified |