SOTAVerified

A Preliminary Study of Croatian Lexical Substitution

2017-04-01WS 2017Unverified0· sign in to hype

Domagoj Alagi{\'c}, Jan {\v{S}}najder

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Lexical substitution is a task of determining a meaning-preserving replacement for a word in context. We report on a preliminary study of this task for the Croatian language on a small-scale lexical sample dataset, manually annotated using three different annotation schemes. We compare the annotations, analyze the inter-annotator agreement, and observe a number of interesting language specific details in the obtained lexical substitutes. Furthermore, we apply a recently-proposed, dependency-based lexical substitution model to our dataset. The model achieves a P@3 score of 0.35, which indicates the difficulty of the task.

Tasks

Reproductions