Gender-preserving Debiasing for Pre-trained Word Embeddings

2019-06-03ACL 2019Code Available0· sign in to hype

Masahiro Kaneko, Danushka Bollegala

Code Available — Be the first to reproduce this paper.

Code

github.com/kanekomasahiro/gp_debias
OfficialIn paperpytorch★ 0

Abstract

Word embeddings learnt from massive text collections have demonstrated significant levels of discriminative biases such as gender, racial or ethnic biases, which in turn bias the down-stream NLP applications that use those word embeddings. Taking gender-bias as a working example, we propose a debiasing method that preserves non-discriminative gender-related information, while removing stereotypical discriminative gender biases from pre-trained word embeddings. Specifically, we consider four types of information: feminine, masculine, gender-neutral and stereotypical, which represent the relationship between gender vs. bias, and propose a debiasing method that (a) preserves the gender-related information in feminine and masculine words, (b) preserves the neutrality in gender-neutral words, and (c) removes the biases from stereotypical words. Experimental results on several previously proposed benchmark datasets show that our proposed method can debias pre-trained word embeddings better than existing SoTA methods proposed for debiasing word embeddings while preserving gender-related but non-discriminative information.

Tasks

Word Embeddings

Gender-preserving Debiasing for Pre-trained Word Embeddings

Code

Abstract

Tasks

Reproductions