SOTAVerified

Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision

2017-09-01EMNLP 2017Unverified0· sign in to hype

Haoruo Peng, Ming-Wei Chang, Wen-tau Yih

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Neural networks have achieved state-of-the-art performance on several structured-output prediction tasks, trained in a fully supervised fashion. However, annotated examples in structured domains are often costly to obtain, which thus limits the applications of neural networks. In this work, we propose Maximum Margin Reward Networks, a neural network-based framework that aims to learn from both explicit (full structures) and implicit supervision signals (delayed feedback on the correctness of the predicted structure). On named entity recognition and semantic parsing, our model outperforms previous systems on the benchmark datasets, CoNLL-2003 and WebQuestionsSP.

Tasks

Reproductions