Learning Activation Functions to Improve Deep Neural Networks

2014-12-21Code Available0· sign in to hype

Forest Agostinelli, Matthew Hoffman, Peter Sadowski, Pierre Baldi

Code Available — Be the first to reproduce this paper.

Code

github.com/ForestAgostinelli/Learned-Activation-Functions-Source
OfficialIn papernone★ 0
github.com/pavaichandru93/Neural-network-for-data-science
none★ 0
github.com/DivyanshRoy/Learning-Activation-Function-Using-Tensorflow
tf★ 0

Abstract

Artificial neural networks typically have a fixed, non-linear activation function at each neuron. We have designed a novel form of piecewise linear activation function that is learned independently for each neuron using gradient descent. With this adaptive activation function, we are able to improve upon deep neural network architectures composed of static rectified linear units, achieving state-of-the-art performance on CIFAR-10 (7.51%), CIFAR-100 (30.83%), and a benchmark from high-energy physics involving Higgs boson decay modes.

Tasks

Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10	NiN+APL	Percentage correct	92.5	—	Unverified
CIFAR-100	NiN+APL	Percentage correct	69.2	—	Unverified

Learning Activation Functions to Improve Deep Neural Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions