SOTAVerified

Lossy Compression for Lossless Prediction

2021-06-21NeurIPS 2021Code Available1· sign in to hype

Yann Dubois, Benjamin Bloem-Reddy, Karen Ullrich, Chris J. Maddison

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Most data is automatically collected and only ever "seen" by algorithms. Yet, data compressors preserve perceptual fidelity rather than just the information needed by algorithms performing downstream tasks. In this paper, we characterize the bit-rate required to ensure high performance on all predictive tasks that are invariant under a set of transformations, such as data augmentations. Based on our theory, we design unsupervised objectives for training neural compressors. Using these objectives, we train a generic image compressor that achieves substantial rate savings (more than 1000 on ImageNet) compared to JPEG on 8 datasets, without decreasing downstream classification performance.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Caltech101Lossyless CompressorBit rate1,340Unverified
Cars-196Lossyless CompressorBit rate1,470Unverified
CIFAR-10Lossyless CompressorBit rate1,410Unverified
Food-101Lossyless CompressorBit rate1,270Unverified
Oxford-IIIT Pet DatasetLossyless CompressorBit rate1,210Unverified
PCamLossyless CompressorBit rate1,490Unverified
STL-10Lossyless CompressorBit rate1,340Unverified

Reproductions