Empirical Evaluation of Rectified Activations in Convolutional Network

2015-05-05Code Available0· sign in to hype

Bing Xu, Naiyan Wang, Tianqi Chen, Mu Li

Code Available — Be the first to reproduce this paper.

Code

github.com/spinterRu/fashion_mnist
tf★ 0
github.com/OsvaldN/APS360_Project
pytorch★ 0

Abstract

In this paper we investigate the performance of different types of rectified activation functions in convolutional neural network: standard rectified linear unit (ReLU), leaky rectified linear unit (Leaky ReLU), parametric rectified linear unit (PReLU) and a new randomized leaky rectified linear units (RReLU). We evaluate these activation function on standard image classification task. Our experiments suggest that incorporating a non-zero slope for negative part in rectified activation units could consistently improve the results. Thus our findings are negative on the common belief that sparsity is the key of good performance in ReLU. Moreover, on small scale dataset, using deterministic negative slope or learning it are both prone to overfitting. They are not as effective as using their randomized counterpart. By using RReLU, we achieved 75.68\% accuracy on CIFAR-100 test set without multiple test or ensemble.

Tasks

General Classification image-classification Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR-10	RReLU	Percentage correct	88.8	—	Unverified
CIFAR-100	RReLU	Percentage correct	59.8	—	Unverified

Empirical Evaluation of Rectified Activations in Convolutional Network

Code

Abstract

Tasks

Benchmark Results

Reproductions