Challenges in including extra-linguistic context in pre-trained language models

2022-05-01insights (ACL) 2022Unverified0· sign in to hype

Ionut Sorodoc, Laura Aina, Gemma Boleda

Unverified — Be the first to reproduce this paper.

Abstract

To successfully account for language, computational models need to take into account both the linguistic context (the content of the utterances) and the extra-linguistic context (for instance, the participants in a dialogue). We focus on a referential task that asks models to link entity mentions in a TV show to the corresponding characters, and design an architecture that attempts to account for both kinds of context. In particular, our architecture combines a previously proposed specialized module (an “entity library”) for character representation with transfer learning from a pre-trained language model. We find that, although the model does improve linguistic contextualization, it fails to successfully integrate extra-linguistic information about the participants in the dialogue. Our work shows that it is very challenging to incorporate extra-linguistic information into pre-trained language models.

Tasks

Language Modeling Language Modelling Transfer Learning

Challenges in including extra-linguistic context in pre-trained language models

Abstract

Tasks

Reproductions