SOTAVerified

Fine-tuning Neural Machine Translation on Gender-Balanced Datasets

2020-12-01GeBNLP (COLING) 2020Unverified0· sign in to hype

Marta R. Costa-jussà, Adrià de Jorge

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Misrepresentation of certain communities in datasets is causing big disruptions in artificial intelligence applications. In this paper, we propose using an automatically extracted gender-balanced dataset parallel corpus from Wikipedia. This balanced set is used to perform fine-tuning techniques from a bigger model trained on unbalanced datasets to mitigate gender biases in neural machine translation.

Tasks

Reproductions