SOTAVerified

Expletives in Universal Dependency Treebanks

2018-11-01WS 2018Code Available0· sign in to hype

Gosse Bouma, Jan Hajic, Dag Haug, Joakim Nivre, Per Erik Solberg, Lilja {\O}vrelid

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Although treebanks annotated according to the guidelines of Universal Dependencies (UD) now exist for many languages, the goal of annotating the same phenomena in a cross-linguistically consistent fashion is not always met. In this paper, we investigate one phenomenon where we believe such consistency is lacking, namely expletive elements. Such elements occupy a position that is structurally associated with a core argument (or sometimes an oblique dependent), yet are non-referential and semantically void. Many UD treebanks identify at least some elements as expletive, but the range of phenomena differs between treebanks, even for closely related languages, and sometimes even for different treebanks for the same language. In this paper, we present criteria for identifying expletives that are applicable across languages and compatible with the goals of UD, give an overview of expletives as found in current UD treebanks, and present recommendations for the annotation of expletives so that more consistent annotation can be achieved in future releases.

Tasks

Reproductions