Asymmetric Loss For Multi-Label Classification

2020-09-29ICCV 2021Code Available1· sign in to hype

Emanuel Ben-Baruch, Tal Ridnik, Nadav Zamir, Asaf Noy, Itamar Friedman, Matan Protter, Lihi Zelnik-Manor

Code Available — Be the first to reproduce this paper.

Code

github.com/Alibaba-MIIL/ASL
OfficialIn paperpytorch★ 789
github.com/mrT23/TResNet
pytorch★ 478
github.com/Alibaba-MIIL/TResNet
pytorch★ 478
github.com/kalelpark/ral
pytorch★ 18
github.com/SlongLiu/ASL_reproduce
pytorch★ 4

Abstract

In a typical multi-label setting, a picture contains on average few positive labels, and many negative ones. This positive-negative imbalance dominates the optimization process, and can lead to under-emphasizing gradients from positive labels during training, resulting in poor accuracy. In this paper, we introduce a novel asymmetric loss ("ASL"), which operates differently on positive and negative samples. The loss enables to dynamically down-weights and hard-thresholds easy negative samples, while also discarding possibly mislabeled samples. We demonstrate how ASL can balance the probabilities of different samples, and how this balancing is translated to better mAP scores. With ASL, we reach state-of-the-art results on multiple popular multi-label datasets: MS-COCO, Pascal-VOC, NUS-WIDE and Open Images. We also demonstrate ASL applicability for other tasks, such as single-label classification and object detection. ASL is effective, easy to implement, and does not increase the training time or complexity. Implementation is available at: https://github.com/Alibaba-MIIL/ASL.

Tasks

Classification General Classification Image Classification Multi-Label Classification MUlTI-LABEL-ClASSIFICATION object-detection Object Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
MS-COCO	TResNet-L (resolution 448)	mAP	86.6	—	Unverified
MS-COCO	TResNet-XL (resolution 640)	mAP	88.4	—	Unverified
NUS-WIDE	TResNet-L (resolution 448)	MAP	65.2	—	Unverified
OpenImages-v6	TResNet-L	mAP	86.3	—	Unverified
OpenImages-v6	TResNet-L	mAP	87.34	—	Unverified
PASCAL VOC 2007	TResNet-L (resolution 448, pretrain from MS-COCO)	mAP	95.8	—	Unverified
PASCAL VOC 2007	TResNet-L (resolution 448, pretrain from ImageNet)	mAP	94.6	—	Unverified

Asymmetric Loss For Multi-Label Classification

Code

Abstract

Tasks

Benchmark Results

Reproductions