SOTAVerified

Turkish Treebank as a Gold Standard for Morphological Disambiguation and Its Influence on Parsing

2014-05-01LREC 2014Unverified0· sign in to hype

{\"O}zlem {\c{C}}etino{\u{g}}lu

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

So far predicted scenarios for Turkish dependency parsing have used a morphological disambiguator that is trained on the data distributed with the tool(Sak et al., 2008). Although models trained on this data have high accuracy scores on the test and development data of the same set, the accuracy drastically drops when the model is used in the preprocessing of Turkish Treebank parsing experiments. We propose to use the Turkish Treebank(Oflazer et al., 2003) as a morphological resource to overcome this problem and convert the treebank to the morphological disambiguatorÂ’s format. The experimental results show that we achieve improvements in disambiguating the Turkish Treebank and the results also carry over to parsing. With the help of better morphological analysis, we present the best labelled dependency parsing scores to date on Turkish.

Tasks

Reproductions