Application of Mix-Up Method in Document Classification Task Using BERT

2021-09-01RANLP 2021Unverified0· sign in to hype

Naoki Kikuta, Hiroyuki Shinnou

Unverified — Be the first to reproduce this paper.

Abstract

The mix-up method (Zhang et al., 2017), one of the methods for data augmentation, is known to be easy to implement and highly effective. Although the mix-up method is intended for image identification, it can also be applied to natural language processing. In this paper, we attempt to apply the mix-up method to a document classification task using bidirectional encoder representations from transformers (BERT) (Devlin et al., 2018). Since BERT allows for two-sentence input, we concatenated word sequences from two documents with different labels and used the multi-class output as the supervised data with a one-hot vector. In an experiment using the livedoor news corpus, which is Japanese, we compared the accuracy of document classification using two methods for selecting documents to be concatenated with that of ordinary document classification. As a result, we found that the proposed method is better than the normal classification when the documents with labels shortages are mixed preferentially. This indicates that how to choose documents for mix-up has a significant impact on the results.

Tasks

Classification Data Augmentation Document Classification Sentence

Application of Mix-Up Method in Document Classification Task Using BERT

Abstract

Tasks

Reproductions