Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

2015-02-28IJCNLP 2015Code Available0· sign in to hype

Kai Sheng Tai, Richard Socher, Christopher D. Manning

Code Available — Be the first to reproduce this paper.

Code

github.com/stanfordnlp/treelstm
OfficialIn papertorch★ 0
github.com/rohitguptacs/ReVal
torch★ 0
github.com/tensorflow/fold
tf★ 0
github.com/Mind23-2/MindCode-17
mindspore★ 0
github.com/jayanti-prasad/TreeLSTM
pytorch★ 0
github.com/zxk19981227/LSTM-SST
pytorch★ 0
github.com/munashe5/SemanticTreeLSTM
tf★ 0
github.com/dmlc/dgl/tree/master/examples/pytorch/tree_lstm
pytorch★ 0
github.com/EmilReinert/DeepLearningPipelines
pytorch★ 0
github.com/vastsak/tree_structured_gru
tf★ 0

Abstract

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of sequence modeling tasks. The only underlying LSTM structure that has been explored so far is a linear chain. However, natural language exhibits syntactic properties that would naturally combine words to phrases. We introduce the Tree-LSTM, a generalization of LSTMs to tree-structured network topologies. Tree-LSTMs outperform all existing systems and strong LSTM baselines on two tasks: predicting the semantic relatedness of two sentences (SemEval 2014, Task 1) and sentiment classification (Stanford Sentiment Treebank).

Tasks

General Classification Semantic Similarity Sentiment Analysis Sentiment Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
SICK	Dependency Tree-LSTM (Tai et al., 2015)	MSE	0.25	—	Unverified
SICK	Bidirectional LSTM (Tai et al., 2015)	MSE	0.27	—	Unverified
SICK	LSTM (Tai et al., 2015)	MSE	0.28	—	Unverified

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions