Clarifying Implicit and Underspecified Phrases in Instructional Text

2022-06-01LREC 2022Unverified0· sign in to hype

Talita Anthonio, Anna Sauer, Michael Roth

Unverified — Be the first to reproduce this paper.

Abstract

Natural language inherently consists of implicit and underspecified phrases, which represent potential sources of misunderstanding. In this paper, we present a data set of such phrases in English from instructional texts together with multiple possible clarifications. Our data set, henceforth called CLAIRE, is based on a corpus of revision histories from wikiHow, from which we extract human clarifications that resolve an implicit or underspecified phrase. We show how language modeling can be used to generate alternate clarifications, which may or may not be compatible with the human clarification. Based on plausibility judgements for each clarification, we define the task of distinguishing between plausible and implausible clarifications. We provide several baseline models for this task and analyze to what extent different clarifications represent multiple readings as a first step to investigate misunderstandings caused by implicit/underspecified language in instructional texts.

Tasks

Language Modeling Language Modelling

Clarifying Implicit and Underspecified Phrases in Instructional Text

Abstract

Tasks

Reproductions