| Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning | Sep 6, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sep 5, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments | Sep 5, 2021 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| An Exploration of Deep Learning Methods in Hungry Geese | Sep 5, 2021 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Temporal Shift Reinforcement Learning | Sep 5, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning for Battery Energy Storage Dispatch augmented with Model-based Optimizer | Sep 2, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Learning Practically Feasible Policies for Online 3D Bin Packing | Aug 31, 2021 | 3D Bin PackingCollision Avoidance | CodeCode Available | 2 |
| WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU | Aug 31, 2021 | CPUDecision Making | CodeCode Available | 1 |
| Learning to Synthesize Programs as Interpretable and Generalizable Policies | Aug 31, 2021 | Deep Reinforcement LearningProgram Synthesis | CodeCode Available | 1 |