A Simple Method for Commonsense Reasoning

2018-06-07Code Available0· sign in to hype

Trieu H. Trinh, Quoc V. Le

Code Available — Be the first to reproduce this paper.

Code

github.com/gabimelo/portuguese_wsc
pytorch★ 6
github.com/tensorflow/models/tree/master/research/lm_commonsense
tf★ 0

Abstract

Commonsense reasoning is a long-standing challenge for deep learning. For example, it is difficult to use neural networks to tackle the Winograd Schema dataset (Levesque et al., 2011). In this paper, we present a simple method for commonsense reasoning with neural networks, using unsupervised learning. Key to our method is the use of language models, trained on a massive amount of unlabled data, to score multiple choice questions posed by commonsense reasoning tests. On both Pronoun Disambiguation and Winograd Schema challenges, our models outperform previous state-of-the-art methods by a large margin, without using expensive annotated knowledge bases or hand-engineered features. We train an array of large RNN language models that operate at word or character level on LM-1-Billion, CommonCrawl, SQuAD, Gutenberg Books, and a customized corpus for this task and show that diversity of training data plays an important role in test performance. Further analysis also shows that our system successfully discovers important features of the context that decide the correct answer, indicating a good grasp of commonsense knowledge.

Tasks

Common Sense Reasoning Coreference Resolution Diversity Multiple-choice Natural Language Understanding

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Winograd Schema Challenge	Ensemble of 14 LMs	Accuracy	63.7	—	Unverified
Winograd Schema Challenge	Word-level CNN+LSTM (partial scoring)	Accuracy	62.6	—	Unverified
Winograd Schema Challenge	Char-level CNN+LSTM (partial scoring)	Accuracy	57.9	—	Unverified

A Simple Method for Commonsense Reasoning

Code

Abstract

Tasks

Benchmark Results

Reproductions