SOTAVerified

Neural Regularized Domain Adaptation for Chinese Word Segmentation

2017-12-01WS 2017Unverified0· sign in to hype

Zuyi Bao, Si Li, Weiran Xu, Sheng Gao

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

For Chinese word segmentation, the large-scale annotated corpora mainly focus on newswire and only a handful of annotated data is available in other domains such as patents and literature. Considering the limited amount of annotated target domain data, it is a challenge for segmenters to learn domain-specific information while avoid getting over-fitted at the same time. In this paper, we propose a neural regularized domain adaptation method for Chinese word segmentation. The teacher networks trained in source domain are employed to regularize the training process of the student network by preserving the general knowledge. In the experiments, our neural regularized domain adaptation method achieves a better performance comparing to previous methods.

Tasks

Reproductions