A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

2016-11-05EMNLP 2017Code Available0· sign in to hype

Kazuma Hashimoto, Caiming Xiong, Yoshimasa Tsuruoka, Richard Socher

Code Available — Be the first to reproduce this paper.

Code

github.com/hassyGo/charNgram2vec
OfficialIn papernone★ 0
github.com/rubythonode/joint-many-task-model
tf★ 0

Abstract

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a single model. We introduce a joint many-task model together with a strategy for successively growing its depth to solve increasingly complex tasks. Higher layers include shortcut connections to lower-level task predictions to reflect linguistic hierarchies. We use a simple regularization term to allow for optimizing all model weights to improve one task's loss without exhibiting catastrophic interference of the other tasks. Our single end-to-end model obtains state-of-the-art or competitive results on five different tasks from tagging, parsing, relatedness, and entailment tasks.

Tasks

Chunking Multi-Task Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Penn Treebank	JMT	F1 score	95.77	—	Unverified

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Code

Abstract

Tasks

Benchmark Results

Reproductions