SOTAVerified

WaveMix: A Resource-efficient Neural Network for Image Analysis

2022-05-28Code Available1· sign in to hype

Pranav Jeevan, Kavitha Viswanathan, Anandu A S, Amit Sethi

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We propose a novel neural architecture for computer vision -- WaveMix -- that is resource-efficient and yet generalizable and scalable. While using fewer trainable parameters, GPU RAM, and computations, WaveMix networks achieve comparable or better accuracy than the state-of-the-art convolutional neural networks, vision transformers, and token mixers for several tasks. This efficiency can translate to savings in time, cost, and energy. To achieve these gains we used multi-level two-dimensional discrete wavelet transform (2D-DWT) in WaveMix blocks, which has the following advantages: (1) It reorganizes spatial information based on three strong image priors -- scale-invariance, shift-invariance, and sparseness of edges -- (2) in a lossless manner without adding parameters, (3) while also reducing the spatial sizes of feature maps, which reduces the memory and time required for forward and backward passes, and (4) expanding the receptive field faster than convolutions do. The whole architecture is a stack of self-similar and resolution-preserving WaveMix blocks, which allows architectural flexibility for various tasks and levels of resource availability. WaveMix establishes new benchmarks for segmentation on Cityscapes; and for classification on Galaxy 10 DECals, Places-365, five EMNIST datasets, and iNAT-mini and performs competitively on other benchmarks. Our code and trained models are publicly available.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Caltech-256WaveMixLite-256/7Accuracy54.62Unverified
CIFAR-10WaveMixLite-144/7Percentage correct97.29Unverified
CIFAR-100WaveMix-Lite-256/7Percentage correct70.2Unverified
CIFAR-100WaveMixLite-256/7Percentage correct85.09Unverified
EMNIST-BalancedWaveMixLite-128/7Accuracy91.06Unverified
EMNIST-ByclassWaveMixLite-128/7Accuracy88.43Unverified
EMNIST-BymergeWaveMixLite-128/16Accuracy91.8Unverified
EMNIST-DigitsWaveMixLite-112/16Accuracy (%)99.82Unverified
EMNIST-LettersWaveMixLite-112/16Accuracy95.96Unverified
Fashion-MNISTWaveMixLitePercentage error5.68Unverified
Galaxy10 DECalsWaveMixTop-1 Accuracy (%)95.42Unverified
ImageNetWaveMix-192/16 (level 3)Top 1 Accuracy74.93Unverified
iNat2021-miniWaveMix-256/16 (level 2)Top 1 Accuracy61.75Unverified
MNISTWaveMixLitePercentage error0.25Unverified
Places365-StandardWaveMix-240/12 (level 4)Top 1 Accuracy56.45Unverified
STL-10WaveMixLite-256/7Percentage correct70.88Unverified
SVHNWaveMixLite-144/15Percentage error1.27Unverified
Tiny ImageNet ClassificationWaveMixLite-144/7Validation Acc77.47Unverified

Reproductions