Robust Multilingual Part-of-Speech Tagging via Adversarial Training

2017-11-14NAACL 2018Code Available0· sign in to hype

Michihiro Yasunaga, Jungo Kasai, Dragomir Radev

Code Available — Be the first to reproduce this paper.

Code

github.com/michiyasunaga/pos_adv
Officialnone★ 0

Abstract

Adversarial training (AT) is a powerful regularization method for neural networks, aiming to achieve robustness to input perturbations. Yet, the specific effects of the robustness obtained from AT are still unclear in the context of natural language processing. In this paper, we propose and analyze a neural POS tagging model that exploits AT. In our experiments on the Penn Treebank WSJ corpus and the Universal Dependencies (UD) dataset (27 languages), we find that AT not only improves the overall tagging accuracy, but also 1) prevents over-fitting well in low resource languages and 2) boosts tagging accuracy for rare / unseen words. We also demonstrate that 3) the improved tagging performance by AT contributes to the downstream task of dependency parsing, and that 4) AT helps the model to learn cleaner word representations. 5) The proposed AT model is generally effective in different sequence labeling tasks. These positive results motivate further use of AT for natural language tasks.

Tasks

Chunking Dependency Parsing Named Entity Recognition (NER)Part-Of-Speech Tagging POS POS Tagging

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CoNLL-2000	Adversarial Training	Exact Span F1	95.25	—	Unverified
CoNLL-2000	BiLSTM-CRF	Exact Span F1	95.18	—	Unverified

Robust Multilingual Part-of-Speech Tagging via Adversarial Training

Code

Abstract

Tasks

Benchmark Results

Reproductions