Soft Threshold Weight Reparameterization for Learnable Sparsity

2020-02-08ICML 2020Code Available1· sign in to hype

Aditya Kusupati, Vivek Ramanujan, Raghav Somani, Mitchell Wortsman, Prateek Jain, Sham Kakade, Ali Farhadi

Code Available — Be the first to reproduce this paper.

Code

github.com/RAIVNLab/STR
OfficialIn paperpytorch★ 91

Abstract

Sparsity in Deep Neural Networks (DNNs) is studied extensively with the focus of maximizing prediction accuracy given an overall parameter budget. Existing methods rely on uniform or heuristic non-uniform sparsity budgets which have sub-optimal layer-wise parameter allocation resulting in a) lower prediction accuracy or b) higher inference cost (FLOPs). This work proposes Soft Threshold Reparameterization (STR), a novel use of the soft-threshold operator on DNN weights. STR smoothly induces sparsity while learning pruning thresholds thereby obtaining a non-uniform sparsity budget. Our method achieves state-of-the-art accuracy for unstructured sparsity in CNNs (ResNet50 and MobileNetV1 on ImageNet-1K), and, additionally, learns non-uniform budgets that empirically reduce the FLOPs by up to 50%. Notably, STR boosts the accuracy over existing results by up to 10% in the ultra sparse (99%) regime and can also be used to induce low-rank (structured sparsity) in RNNs. In short, STR is a simple mechanism which learns effective sparsity budgets that contrast with popular heuristics. Code, pretrained models and sparsity budgets are at https://github.com/RAIVNLab/STR.

Tasks

Network Pruning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ImageNet - ResNet 50 - 90% sparsity	STR	Top-1 Accuracy	74.31	—	Unverified
ImageNet - ResNet 50 - 90% sparsity	GMP	Top-1 Accuracy	73.91	—	Unverified

Soft Threshold Weight Reparameterization for Learnable Sparsity

Code

Abstract

Tasks

Benchmark Results

Reproductions