Recurrent Independent Mechanisms

2019-09-24ICLR 2021Code Available0· sign in to hype

Anirudh Goyal, Alex Lamb, Jordan Hoffmann, Shagun Sodhani, Sergey Levine, Yoshua Bengio, Bernhard Schölkopf

Code Available — Be the first to reproduce this paper.

Code

github.com/fuyuan-li/tensorflow-RIMs
tf★ 0
github.com/dido1998/Recurrent-Independent-Mechanisms
pytorch★ 0
github.com/anirudh9119/RIMs
pytorch★ 0

Abstract

Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and are only updated at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for dramatically improved generalization on tasks where some factors of variation differ systematically between training and evaluation.

Tasks

Atari Games

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Atari 2600 Beam Rider	RIMs-PPO	Score	5,320	—	Unverified
Atari 2600 Up and Down	RIMs-PPO	Score	390,000	—	Unverified
Atari 2600 Zaxxon	RIMs-PPO	Score	15,000	—	Unverified

Recurrent Independent Mechanisms

Code

Abstract

Tasks

Benchmark Results

Reproductions