SOTAVerified

Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus

2012-05-01LREC 2012Unverified0· sign in to hype

Veronika Vincze

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this paper, we describe the first English-Hungarian parallel corpus annotated for light verb constructions, which contains 14,261 sentence alignment units. Annotation principles and statistical data on the corpus are also provided, and English and Hungarian data are contrasted. On the basis of corpus data, a database containing pairs of English-Hungarian light verb constructions has been created as well. The corpus and the database can contribute to the automatic detection of light verb constructions and it is also shown how they can enhance performance in several fields of NLP (e.g. parsing, information extraction/retrieval and machine translation).

Tasks

Reproductions