Learning to Generate Examples for Semantic Processing Tasks

2022-07-01NAACL 2022Unverified0· sign in to hype

Danilo Croce, Simone Filice, Giuseppe Castellucci, Roberto Basili

Unverified — Be the first to reproduce this paper.

Abstract

Even if recent Transformer-based architectures, such as BERT, achieved impressive results in semantic processing tasks, their fine-tuning stage still requires large scale training resources. Usually, Data Augmentation (DA) techniques can help to deal with low resource settings. In Text Classification tasks, the objective of DA is the generation of well-formed sentences that i) represent the desired task category and ii) are novel with respect to existing sentences. In this paper, we propose a neural approach to automatically learn to generate new examples using a pre-trained sequence-to-sequence model. We first learn a task-oriented similarity function that we use to pair similar examples. Then, we use these example pairs to train a model to generate examples. Experiments in low resource settings show that augmenting the training material with the proposed strategy systematically improves the results on text classification and natural language inference tasks by up to 10% accuracy, outperforming existing DA approaches.

Tasks

Data Augmentation Natural Language Inference text-classification Text Classification

Learning to Generate Examples for Semantic Processing Tasks

Abstract

Tasks

Reproductions