TurkuNLP: Delexicalized Pre-training of Word Embeddings for Dependency Parsing
2017-08-01CONLL 2017Unverified0· sign in to hype
Jenna Kanerva, Juhani Luotolahti, Filip Ginter
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We present the TurkuNLP entry in the CoNLL 2017 Shared Task on Multilingual Parsing from Raw Text to Universal Dependencies. The system is based on the UDPipe parser with our focus being in exploring various techniques to pre-train the word embeddings used by the parser in order to improve its performance especially on languages with small training sets. The system ranked 11th among the 33 participants overall, being 8th on the small treebanks, 10th on the large treebanks, 12th on the parallel test sets, and 26th on the surprise languages.