Involution: Inverting the Inherence of Convolution for Visual Recognition

2021-03-10CVPR 2021Code Available2· sign in to hype

Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen

Code Available — Be the first to reproduce this paper.

Code

github.com/d-li14/involution
OfficialIn paperpytorch★ 1,316
github.com/PaddlePaddle/PaddleClas
paddle★ 5,788
github.com/ChristophReich1996/Involution
pytorch★ 105
github.com/shikishima-TasakiLab/Involution-PyTorch
pytorch★ 21
github.com/ariG23498/involution-tf
tf★ 13
github.com/YirunKCL/Tensorflow-Keras-Involution2D
tf★ 12
github.com/shuuchen/involution.pytorch
pytorch★ 8
github.com/PrivateMaRyan/keras-involution2Ds
tf★ 3
github.com/PrivateMaRyan/keras-involution2D
tf★ 3
github.com/justanhduc/involution
pytorch★ 2

Abstract

Convolution has been the core ingredient of modern neural networks, triggering the surge of deep learning in vision. In this work, we rethink the inherent principles of standard convolution for vision tasks, specifically spatial-agnostic and channel-specific. Instead, we present a novel atomic operation for deep neural networks by inverting the aforementioned design principles of convolution, coined as involution. We additionally demystify the recent popular self-attention operator and subsume it into our involution family as an over-complicated instantiation. The proposed involution operator could be leveraged as fundamental bricks to build the new generation of neural networks for visual recognition, powering different deep learning models on several prevalent benchmarks, including ImageNet classification, COCO detection and segmentation, together with Cityscapes segmentation. Our involution-based models improve the performance of convolutional baselines using ResNet-50 by up to 1.6% top-1 accuracy, 2.5% and 2.4% bounding box AP, and 4.7% mean IoU absolutely while compressing the computational cost to 66%, 65%, 72%, and 57% on the above benchmarks, respectively. Code and pre-trained models for all the tasks are available at https://github.com/d-li14/involution.

Tasks

Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ImageNet	RedNet-152	Top 1 Accuracy	79.3	—	Unverified
ImageNet	RedNet-101	Top 1 Accuracy	79.1	—	Unverified
ImageNet	RedNet-50	Top 1 Accuracy	78.4	—	Unverified
ImageNet	RedNet-38	Top 1 Accuracy	77.6	—	Unverified
ImageNet	RedNet-26	Top 1 Accuracy	75.9	—	Unverified

Involution: Inverting the Inherence of Convolution for Visual Recognition

Code

Abstract

Tasks

Benchmark Results

Reproductions