SOTAVerified

A Semi-Supervised Approach for Low-Resourced Text Generation

2019-06-03Code Available0· sign in to hype

Hongyu Zang, Xiaojun Wan

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Recently, encoder-decoder neural models have achieved great success on text generation tasks. However, one problem of this kind of models is that their performances are usually limited by the scale of well-labeled data, which are very expensive to get. The low-resource (of labeled data) problem is quite common in different task generation tasks, but unlabeled data are usually abundant. In this paper, we propose a method to make use of the unlabeled data to improve the performance of such models in the low-resourced circumstances. We use denoising auto-encoder (DAE) and language model (LM) based reinforcement learning (RL) to enhance the training of encoder and decoder with unlabeled data. Our method shows adaptability for different text generation tasks, and makes significant improvements over basic text generation models.

Tasks

Reproductions