Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

2021-05-07NAACL 2021Code Available0· sign in to hype

Anne Beyer, Sharid Loáiciga, David Schlangen

Code Available — Be the first to reproduce this paper.

Code

github.com/AnneBeyer/coherencegym
OfficialIn paperpytorch★ 3

Abstract

Coherent discourse is distinguished from a mere collection of utterances by the satisfaction of a diverse set of constraints, for example choice of expression, logical relation between denoted events, and implicit compatibility with world-knowledge. Do neural language models encode such constraints? We design an extendable set of test suites addressing different aspects of discourse and dialogue coherence. Unlike most previous coherence evaluation studies, we address specific linguistic devices beyond sentence order perturbations, allowing for a more fine-grained analysis of what constitutes coherence and what neural models trained on a language modelling objective do encode. Extending the targeted evaluation paradigm for neural language models (Marvin and Linzen, 2018) to phenomena beyond syntax, we show that this paradigm is equally suited to evaluate linguistic qualities that contribute to the notion of coherence.

Tasks

Coherence Evaluation Language Modelling Sentence World Knowledge

Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language Models

Code

Abstract

Tasks

Reproductions