SOTAVerified

Lemmatization of Multi-word Common Noun Phrases and Named Entities in Polish

2017-09-01RANLP 2017Unverified0· sign in to hype

Micha{\l} Marci{\'n}czuk

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In the paper we present a tool for lemmatization of multi-word common noun phrases and named entities for Polish called LemmaPL. The tool is based on a set of manually crafted rules and heuristics utilizing a set of dictionaries (including morphological, named entities and inflection patterns). The accuracy of lemmatization obtained by the tool reached 97.99\% on a dataset with multi-word common noun phrases and 86.17\% for case-sensitive evaluation on a dataset with named entities.

Tasks

Reproductions