Dynamic Evaluation of Neural Sequence Models

2017-09-21ICML 2018Code Available0· sign in to hype

Ben Krause, Emmanuel Kahembwe, Iain Murray, Steve Renals

Code Available — Be the first to reproduce this paper.

Code

github.com/benkrause/dynamic-evaluation
OfficialIn paperpytorch★ 0
github.com/sacmehta/PRU
pytorch★ 0
github.com/benkrause/dynamiceval-transformer
tf★ 0

Abstract

We present methodology for using dynamic evaluation to improve neural sequence models. Models are adapted to recent history via a gradient descent based mechanism, causing them to assign higher probabilities to re-occurring sequential patterns. Dynamic evaluation outperforms existing adaptation approaches in our comparisons. Dynamic evaluation improves the state-of-the-art word-level perplexities on the Penn Treebank and WikiText-2 datasets to 51.1 and 44.3 respectively, and the state-of-the-art character-level cross-entropies on the text8 and Hutter Prize datasets to 1.19 bits/char and 1.08 bits/char respectively.

Tasks

Language Modelling

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Hutter Prize	mLSTM + dynamic eval	Bit per Character (BPC)	1.08	—	Unverified
Penn Treebank (Word Level)	AWD-LSTM + dynamic eval	Test perplexity	51.1	—	Unverified
Text8	mLSTM + dynamic eval	Bit per Character (BPC)	1.19	—	Unverified
WikiText-2	AWD-LSTM + dynamic eval	Test perplexity	44.3	—	Unverified

Dynamic Evaluation of Neural Sequence Models

Code

Abstract

Tasks

Benchmark Results

Reproductions