ILP-M Conv: Optimize Convolution Algorithm for Single-Image Convolution Neural Network Inference on Mobile GPUs
2019-09-06Code Available0· sign in to hype
Zhuoran Ji
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/jizhuoran/sj_convolutionOfficialIn papernone★ 0
Abstract
Convolution neural networks are widely used for mobile applications. However, GPU convolution algorithms are designed for mini-batch neural network training, the single-image convolution neural network inference algorithm on mobile GPUs is not well-studied. After discussing the usage difference and examining the existing convolution algorithms, we proposed the HNTMP convolution algorithm. The HNTMP convolution algorithm achieves 14.6 speedup than the most popular im2col convolution algorithm, and 2.30 speedup than the fastest existing convolution algorithm (direct convolution) as far as we know.