SOTAVerified

TurkuNLP: Delexicalized Pre-training of Word Embeddings for Dependency Parsing

2017-08-01CONLL 2017Unverified0· sign in to hype

Jenna Kanerva, Juhani Luotolahti, Filip Ginter

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present the TurkuNLP entry in the CoNLL 2017 Shared Task on Multilingual Parsing from Raw Text to Universal Dependencies. The system is based on the UDPipe parser with our focus being in exploring various techniques to pre-train the word embeddings used by the parser in order to improve its performance especially on languages with small training sets. The system ranked 11th among the 33 participants overall, being 8th on the small treebanks, 10th on the large treebanks, 12th on the parallel test sets, and 26th on the surprise languages.

Tasks

Reproductions