| Learning to Locomote: Understanding How Environment Design Matters for Deep Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models | Oct 9, 2020 | Deep Reinforcement LearningEpidemiology | CodeCode Available | 1 |
| Learning Intrinsic Symbolic Rewards in Reinforcement Learning | Oct 8, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Guided Curriculum Learning for Walking Over Complex Terrain | Oct 8, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Prioritized Level Replay | Oct 8, 2020 | Deep Reinforcement LearningSystematic Generalization | CodeCode Available | 1 |
| Information-Driven Adaptive Sensing Based on Deep Reinforcement Learning | Oct 8, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Actor-Critic Algorithm for High-dimensional Partial Differential Equations | Oct 7, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Safety Assurance for Deep Reinforcement Learning | Oct 7, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Proximal Policy Optimization with Relative Pearson Divergence | Oct 7, 2020 | Deep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning-Based Dynamic Resource Management for Mobile Edge Computing in Industrial Internet of Things | Oct 6, 2020 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |