Adaptive Rational Activations to Boost Deep Reinforcement Learning
Quentin Delfosse, Patrick Schramowski, Martin Mundt, Alejandro Molina, Kristian Kersting
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/ml-research/rational_activationsOfficialIn paperpytorch★ 105
- github.com/ml-research/rational_rlOfficialIn paperpytorch★ 11
- github.com/k4ntz/activation-functionsOfficialIn paperpytorch★ 3
- github.com/ml-research/rational_slOfficialIn paperpytorch★ 2
Abstract
Latest insights from biology show that intelligence not only emerges from the connections between neurons but that individual neurons shoulder more computational responsibility than previously anticipated. This perspective should be critical in the context of constantly changing distinct reinforcement learning environments, yet current approaches still primarily employ static activation functions. In this work, we motivate why rationals are suitable for adaptable activation functions and why their inclusion into neural networks is crucial. Inspired by recurrence in residual networks, we derive a condition under which rational units are closed under residual connections and formulate a naturally regularised version: the recurrent-rational. We demonstrate that equipping popular algorithms with (recurrent-)rational activations leads to consistent improvements on Atari games, especially turning simple DQN into a solid approach, competitive to DDQN and Rainbow.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Atari 2600 Asterix | Rational DQN Average | Score | 18,109 | — | Unverified |
| Atari 2600 Asterix | Recurrent Rational DQN Average | Score | 12,621 | — | Unverified |
| Atari 2600 Battle Zone | Recurrent Rational DQN Average | Score | 25,749 | — | Unverified |
| Atari 2600 Battle Zone | Rational DQN Average | Score | 23,403 | — | Unverified |
| Atari 2600 Breakout | Recurrent Rational DQN Average | Score | 336 | — | Unverified |
| Atari 2600 Breakout | Rational DQN Average | Score | 316 | — | Unverified |
| Atari 2600 Enduro | Rational DQN Average | Score | 1,043 | — | Unverified |
| Atari 2600 Enduro | Recurrent Rational DQN Average | Score | 957 | — | Unverified |
| Atari 2600 James Bond | Rational DQN Average | Score | 1,122 | — | Unverified |
| Atari 2600 James Bond | Recurrent Rational DQN Average | Score | 1,137 | — | Unverified |
| Atari 2600 Kangaroo | Recurrent Rational DQN Average | Score | 5,266 | — | Unverified |
| Atari 2600 Kangaroo | Rational DQN Average | Score | 2,941 | — | Unverified |
| Atari 2600 Pong | Recurrent Rational DQN Average | Score | 18.13 | — | Unverified |
| Atari 2600 Pong | Rational DQN Average | Score | 18.04 | — | Unverified |
| Atari 2600 Q*Bert | Recurrent Rational DQN Average | Score | 14,080 | — | Unverified |
| Atari 2600 Q*Bert | Rational DQN Average | Score | 14,436 | — | Unverified |
| Atari 2600 Seaquest | Rational DQN Average | Score | 6,603 | — | Unverified |
| Atari 2600 Seaquest | Recurrent Rational DQN Average | Score | 7,460 | — | Unverified |
| Atari 2600 Skiing | Recurrent Rational DQN Average | Score | -23,582 | — | Unverified |
| Atari 2600 Skiing | Rational DQN Average | Score | -23,487 | — | Unverified |
| Atari 2600 Space Invaders | Rational DQN Average | Score | 650 | — | Unverified |
| Atari 2600 Space Invaders | Recurrent Rational DQN Average | Score | 1,395 | — | Unverified |
| Atari 2600 Tennis | Rational DQN Average | Score | 20.5 | — | Unverified |
| Atari 2600 Tennis | Recurrent Rational DQN Average | Score | 20.6 | — | Unverified |
| Atari 2600 Time Pilot | Recurrent Rational DQN Average | Score | 13,261 | — | Unverified |
| Atari 2600 Time Pilot | Rational DQN Average | Score | 17,632 | — | Unverified |
| Atari 2600 Tutankham | Rational DQN Average | Score | 179 | — | Unverified |
| Atari 2600 Tutankham | Recurrent Rational DQN Average | Score | 184 | — | Unverified |
| Atari 2600 Video Pinball | Rational DQN Average | Score | 149,712 | — | Unverified |
| Atari 2600 Video Pinball | Recurrent Rational DQN Average | Score | 86,942 | — | Unverified |