Long Short-Term Memory-Networks for Machine Reading

2016-01-25EMNLP 2016Code Available0· sign in to hype

Jianpeng Cheng, Li Dong, Mirella Lapata

Code Available — Be the first to reproduce this paper.

Code

github.com/muu4649/word2vec_attention
tf★ 0
github.com/oneil512/lstmn
pytorch★ 0

Abstract

In this paper we address the question of how to render sequence-level networks better at handling structured input. We propose a machine reading simulator which processes text incrementally from left to right and performs shallow reasoning with memory and attention. The reader extends the Long Short-Term Memory architecture with a memory network in place of a single memory cell. This enables adaptive memory usage during recurrence with neural attention, offering a way to weakly induce relations among tokens. The system is initially designed to process a single sequence but we also demonstrate how to integrate it with an encoder-decoder architecture. Experiments on language modeling, sentiment analysis, and natural language inference show that our model matches or outperforms the state of the art.

Tasks

Decoder Language Modeling Language Modelling Natural Language Inference Reading Comprehension Sentiment Analysis

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
SNLI	450D LSTMN with deep attention fusion	% Test Accuracy	86.3	—	Unverified
SNLI	300D LSTMN with deep attention fusion	% Test Accuracy	85.7	—	Unverified

Long Short-Term Memory-Networks for Machine Reading

Code

Abstract

Tasks

Benchmark Results

Reproductions