SOTAVerified

A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds

2016-05-01LREC 2016Unverified0· sign in to hype

Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner, Manfred Pinkal

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present an annotation study on a representative dataset of literal and idiomatic uses of German infinitive-verb compounds in newspaper and journal texts. Infinitive-verb compounds form a challenge for writers of German, because spelling regulations are different for literal and idiomatic uses. Through the participation of expert lexicographers we were able to obtain a high-quality corpus resource which offers itself as a testbed for automatic idiomaticity detection and coarse-grained word-sense disambiguation. We trained a classifier on the corpus which was able to distinguish literal and idiomatic uses with an accuracy of 85 \%.

Tasks

Reproductions