Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer Approach
2021-11-01Findings (EMNLP) 2021Code Available0· sign in to hype
Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/anup-deshmukh/unsupervised-chunkingOfficialIn paperpytorch★ 4
Abstract
In this paper, we address unsupervised chunking as a new task of syntactic structure induction, which is helpful for understanding the linguistic structures of human languages as well as processing low-resource languages. We propose a knowledge-transfer approach that heuristically induces chunk labels from state-of-the-art unsupervised parsing models; a hierarchical recurrent neural network (HRNN) learns from such induced chunk labels to smooth out the noise of the heuristics. Experiments show that our approach largely bridges the gap between supervised and unsupervised chunking.