Dynamic Convolution: Attention over Convolution Kernels

2019-12-07CVPR 2020Code Available0· sign in to hype

Yinpeng Chen, Xiyang Dai, Mengchen Liu, Dong-Dong Chen, Lu Yuan, Zicheng Liu

Code Available — Be the first to reproduce this paper.

Code

github.com/mindspore-courses/External-Attention-MindSpore/blob/main/model/conv/DynamicConv.py
mindspore★ 0
github.com/prstrive/CondConv-tensorflow
tf★ 0
github.com/kaijieshi7/Dynamic-convolution-Pytorch
pytorch★ 0
github.com/TArdelean/DynamicConvolution
pytorch★ 0

Abstract

Light-weight convolutional neural networks (CNNs) suffer performance degradation as their low computational budgets constrain both the depth (number of convolution layers) and the width (number of channels) of CNNs, resulting in limited representation capability. To address this issue, we present Dynamic Convolution, a new design that increases model complexity without increasing the network depth or width. Instead of using a single convolution kernel per layer, dynamic convolution aggregates multiple parallel convolution kernels dynamically based upon their attentions, which are input dependent. Assembling multiple kernels is not only computationally efficient due to the small kernel size, but also has more representation power since these kernels are aggregated in a non-linear way via attention. By simply using dynamic convolution for the state-of-the-art architecture MobileNetV3-Small, the top-1 accuracy of ImageNet classification is boosted by 2.9% with only 4% additional FLOPs and 2.9 AP gain is achieved on COCO keypoint detection.

Tasks

Image Classification Keypoint Detection

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
ImageNet	DY-MobileNetV2 ×1.0	Top 1 Accuracy	74.4	—	Unverified
ImageNet	DY-MobileNetV2 ×0.75	Top 1 Accuracy	72.8	—	Unverified
ImageNet	DY-ResNet-18	Top 1 Accuracy	72.7	—	Unverified
ImageNet	DY-MobileNetV3-Small	Top 1 Accuracy	69.7	—	Unverified
ImageNet	DY-MobileNetV2 ×0.5	Top 1 Accuracy	69.4	—	Unverified
ImageNet	DY-ResNet-10	Top 1 Accuracy	67.7	—	Unverified
ImageNet	DY-MobileNetV2 ×0.35	Top 1 Accuracy	64.9	—	Unverified

Dynamic Convolution: Attention over Convolution Kernels

Code

Abstract

Tasks

Benchmark Results

Reproductions