Attribute Injection for Pretrained Language Models: A New Benchmark and an Efficient Method

2022-10-01COLING 2022Code Available0· sign in to hype

Reinald Kim Amplayo, Kang Min Yoo, Sang-Woo Lee

Code Available — Be the first to reproduce this paper.

Code

github.com/rktamplayo/injector
OfficialIn papernone★ 0

Abstract

Metadata attributes (e.g., user and product IDs from reviews) can be incorporated as additional inputs to neural-based NLP models, by expanding the architecture of the models to improve performance. However, recent models rely on pretrained language models (PLMs), in which previously used techniques for attribute injection are either nontrivial or cost-ineffective. In this paper, we introduce a benchmark for evaluating attribute injection models, which comprises eight datasets across a diverse range of tasks and domains and six synthetically sparsified ones. We also propose a lightweight and memory-efficient method to inject attributes into PLMs. We extend adapters, i.e. tiny plug-in feed-forward modules, to include attributes both independently of or jointly with the text. We use approximation techniques to parameterize the model efficiently for domains with large attribute vocabularies, and training mechanisms to handle multi-labeled and sparse attributes. Extensive experiments and analyses show that our method outperforms previous attribute injection methods and achieves state-of-the-art performance on all datasets.

Tasks

Attribute

Attribute Injection for Pretrained Language Models: A New Benchmark and an Efficient Method

Code

Abstract

Tasks

Reproductions