SOTAVerified

Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction

2020-07-01ACL 2020Unverified0· sign in to hype

Mladen Karan, Ivan Vuli{\'c}, Anna Korhonen, Goran Glava{\v{s}}

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Effective projection-based cross-lingual word embedding (CLWE) induction critically relies on the iterative self-learning procedure. It gradually expands the initial small seed dictionary to learn improved cross-lingual mappings. In this work, we present ClassyMap, a classification-based approach to self-learning, yielding a more robust and a more effective induction of projection-based CLWEs. Unlike prior self-learning methods, our approach allows for integration of diverse features into the iterative process. We show the benefits of ClassyMap for bilingual lexicon induction: we report consistent improvements in a weakly supervised setup (500 seed translation pairs) on a benchmark with 28 language pairs.

Tasks

Reproductions