GhostNetV2: Enhance Cheap Operation with Long-Range Attention

2022-11-23Code Available2· sign in to hype

Yehui Tang, Kai Han, Jianyuan Guo, Chang Xu, Chao Xu, Yunhe Wang

Code Available — Be the first to reproduce this paper.

Code

github.com/leondgarse/keras_cv_attention_models
tf★ 623
github.com/likyoo/GhostNetV2-PyTorch
pytorch★ 12
github.com/MindSpore-paper-code-3/code4/tree/main/ghostnetv2
mindspore★ 0
github.com/MindSpore-paper-code-3/code10/tree/main/ghostnetv2
mindspore★ 0
github.com/MindSpore-paper-code-3/code10/tree/main/ghostnet_quant
mindspore★ 0
github.com/2023-MindSpore-4/Code10/tree/main/ghostnetv2
mindspore★ 0
github.com/2023-MindSpore-1/ms-code-214/tree/main/Inception-v2
mindspore★ 0
github.com/MindSpore-paper-code-3/code7/tree/main/ghostnet
mindspore★ 0
github.com/MindSpore-paper-code-3/code4/tree/main/ghostnet_quant
mindspore★ 0

Abstract

Light-weight convolutional neural networks (CNNs) are specially designed for applications on mobile devices with faster inference speed. The convolutional operation can only capture local information in a window region, which prevents performance from being further improved. Introducing self-attention into convolution can capture global information well, but it will largely encumber the actual speed. In this paper, we propose a hardware-friendly attention mechanism (dubbed DFC attention) and then present a new GhostNetV2 architecture for mobile applications. The proposed DFC attention is constructed based on fully-connected layers, which can not only execute fast on common hardware but also capture the dependence between long-range pixels. We further revisit the expressiveness bottleneck in previous GhostNet and propose to enhance expanded features produced by cheap operations with DFC attention, so that a GhostNetV2 block can aggregate local and long-range information simultaneously. Extensive experiments demonstrate the superiority of GhostNetV2 over existing architectures. For example, it achieves 75.3% top-1 accuracy on ImageNet with 167M FLOPs, significantly suppressing GhostNetV1 (74.5%) with a similar computational cost. The source code will be available at https://github.com/huawei-noah/Efficient-AI-Backbones/tree/master/ghostnetv2_pytorch and https://gitee.com/mindspore/models/tree/master/research/cv/ghostnetv2.

GhostNetV2: Enhance Cheap Operation with Long-Range Attention

Code

Abstract

Reproductions