SOTAVerified

Multilingual Sense Intersection in a Parallel Corpus with Diverse Language Families

2016-01-01GWC 2016Unverified0· sign in to hype

Giulia Bonansinga, Francis Bond

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Supervised methods for Word Sense Disambiguation (WSD) benefit from high-quality sense-annotated resources, which are lacking for many languages less common than English. There are, however, several multilingual parallel corpora that can be inexpensively annotated with senses through cross-lingual methods. We test the effectiveness of such an approach by attempting to disambiguate English texts through their translations in Italian, Romanian and Japanese. Specifically, we try to find the appropriate word senses for the English words by comparison with all the word senses associated to their translations. The main advantage of this approach is in that it can be applied to any parallel corpus, as long as large, high-quality inter-linked sense inventories exist for all the languages considered.

Tasks

Reproductions