Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble

2020-11-01EMNLP 2020Code Available1· sign in to hype

Peerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich, Sarana Nutanong

Code Available — Be the first to reproduce this paper.

Code

github.com/mrpeerat/SEFR_CUT
Officialtf★ 20

Abstract

Like many Natural Language Processing tasks, Thai word segmentation is domain-dependent. Researchers have been relying on transfer learning to adapt an existing model to a new domain. However, this approach is inapplicable to cases where we can interact with only input and output layers of the models, also known as ``black boxes''. We propose a filter-and-refine solution based on the stacked-ensemble learning paradigm to address this black-box limitation. We conducted extensive experimental studies comparing our method against state-of-the-art models and transfer learning. Experimental results show that our proposed solution is an effective domain adaptation method and has a similar performance as the transfer learning method.

Tasks

Domain Adaptation Ensemble Learning Thai Word Segmentation Transfer Learning

Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble

Code

Abstract

Tasks

Reproductions