Recurrent Batch Normalization

2016-03-30Code Available0· sign in to hype

Tim Cooijmans, Nicolas Ballas, César Laurent, Çağlar Gülçehre, Aaron Courville

Code Available — Be the first to reproduce this paper.

Code

github.com/cooijmanstim/recurrent-batch-normalization
torch★ 0
github.com/codedecde/Recognizing-Textual-Entailment
pytorch★ 0
github.com/Tetsuya-Nishikawa/ConvLSTM_DEMO
tf★ 0

Abstract

We propose a reparameterization of LSTM that brings the benefits of batch normalization to recurrent neural networks. Whereas previous works only apply batch normalization to the input-to-hidden transformation of RNNs, we demonstrate that it is both possible and beneficial to batch-normalize the hidden-to-hidden transition, thereby reducing internal covariate shift between time steps. We evaluate our proposal on various sequential problems such as sequence classification, language modeling and question answering. Our empirical results show that our batch-normalized LSTM consistently leads to faster convergence and improved generalization.

Tasks

General Classification Language Modeling Language Modelling Question Answering Reading Comprehension Sequential Image Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Text8	BN LSTM	Bit per Character (BPC)	1.36	—	Unverified

Recurrent Batch Normalization

Code

Abstract

Tasks

Benchmark Results

Reproductions