Caffe con Troll: Shallow Ideas to Speed Up Deep Learning

2015-04-16Code Available0· sign in to hype

Stefan Hadjis, Firas Abuzaid, Ce Zhang, Christopher Ré

Code Available — Be the first to reproduce this paper.

Code

github.com/HazyResearch/CaffeConTroll
OfficialIn papernone★ 0

Abstract

We present Caffe con Troll (CcT), a fully compatible end-to-end version of the popular framework Caffe with rebuilt internals. We built CcT to examine the performance characteristics of training and deploying general-purpose convolutional neural networks across different hardware architectures. We find that, by employing standard batching optimizations for CPU training, we achieve a 4.5x throughput improvement over Caffe on popular networks like CaffeNet. Moreover, with these improvements, the end-to-end training time for CNNs is directly proportional to the FLOPS delivered by the CPU, which enables us to efficiently train hybrid CPU-GPU systems for CNNs.

Tasks

CPU Deep Learning GPU

Caffe con Troll: Shallow Ideas to Speed Up Deep Learning

Code

Abstract

Tasks

Reproductions