| Maximum Entropy Monte-Carlo Planning | Dec 1, 2019 | Atari GamesDecision Making | —Unverified | 0 |
| SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies | Dec 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 |
| Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms | Nov 24, 2019 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Planning with Goal-Conditioned Policies | Nov 19, 2019 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Working Memory Graphs | Nov 17, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| One-shot learning and behavioral eligibility traces in sequential decision making | Nov 12, 2019 | Decision MakingLearning Theory | —Unverified | 0 |
| A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data | Nov 11, 2019 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Adaptivity in Adaptive Submodularity | Nov 9, 2019 | Active LearningDecision Making | —Unverified | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |