Mass-Editing Memory in a Transformer

2022-10-13Code Available2· sign in to hype

Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, David Bau

Code Available — Be the first to reproduce this paper.

Code

github.com/kmeng01/memit
Officialpytorch★ 544
github.com/orange-opensource/wikifactdiff
pytorch★ 13

Abstract

Recent work has shown exciting promise in updating large language models with new memories, so as to replace obsolete information or add specialized knowledge. However, this line of work is predominantly limited to updating single associations. We develop MEMIT, a method for directly updating a language model with many memories, demonstrating experimentally that it can scale up to thousands of associations for GPT-J (6B) and GPT-NeoX (20B), exceeding prior work by orders of magnitude. Our code and data are at https://memit.baulab.info.

Tasks

Language Modeling Language Modelling

Mass-Editing Memory in a Transformer

Code

Abstract

Tasks

Reproductions