Grammatical Analysis of Pretrained Sentence Encoders with Acceptability Judgments

2018-12-11Unverified0· sign in to hype

Anonymous

Unverified — Be the first to reproduce this paper.

Abstract

Recent pretrained sentence encoders achieve state of the art results on language understanding tasks, but does this mean they have implicit knowledge of syntactic structures? We introduce a grammatically annotated development set for the Corpus of Linguistic Acceptability (CoLA; Warstadt et al., 2018), which we use to investigate the grammatical knowledge of three pretrained encoders, including the popular OpenAI Transformer (Radford et al., 2018) and BERT (Devlin et al., 2018). We fine-tune these encoders to do acceptability classification over CoLA and compare the models’ performance on the annotated analysis set. Some phenomena, e.g. modification by adjuncts, are easy to learn for all models, while others, e.g. long-distance movement, are learned effectively only by models with strong overall performance, and others still, e.g. morphological agreement, are hardly learned by any model.

Tasks

CoLA Linguistic Acceptability Sentence

Grammatical Analysis of Pretrained Sentence Encoders with Acceptability Judgments

Abstract

Tasks

Reproductions