Data Noising as Smoothing in Neural Network Language Models

2017-03-07Code Available0· sign in to hype

Ziang Xie, Sida I. Wang, Jiwei Li, Daniel Lévy, Aiming Nie, Dan Jurafsky, Andrew Y. Ng

Code Available — Be the first to reproduce this paper.

Code

github.com/stanfordmlgroup/nlm-noising
tf★ 0

Abstract

Data noising is an effective technique for regularizing neural network models. While noising is widely adopted in application domains such as vision and speech, commonly used noising primitives have not been developed for discrete sequence-level settings such as language modeling. In this paper, we derive a connection between input noising in neural network language models and smoothing in n-gram models. Using this connection, we draw upon ideas from smoothing to develop effective noising schemes. We demonstrate performance gains when applying the proposed schemes to language modeling and machine translation. Finally, we provide empirical analysis validating the relationship between noising and smoothing.

Tasks

Language Modeling Language Modelling Machine Translation Translation

Data Noising as Smoothing in Neural Network Language Models

Code

Abstract

Tasks

Reproductions