SOTAVerified

Exploring Cross-lingual Text Detoxification with Large Multilingual Language Models.

2022-05-01ACL 2022Code Available0· sign in to hype

Daniil Moskovskiy, Daryna Dementieva, Alexander Panchenko

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Detoxification is a task of generating text in polite style while preserving meaning and fluency of the original toxic text. Existing detoxification methods are monolingual i.e. designed to work in one exact language. This work investigates multilingual and cross-lingual detoxification and the behavior of large multilingual models in this setting. Unlike previous works we aim to make large language models able to perform detoxification without direct fine-tuning in a given language. Experiments show that multilingual models are capable of performing multilingual style transfer. However, tested state-of-the-art models are not able to perform cross-lingual detoxification and direct fine-tuning on exact language is currently inevitable and motivating the need of further research in this direction.

Tasks

Reproductions