| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Programmatic Reinforcement Learning without Oracles | Sep 29, 2021 | Bilevel OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Task-driven Discovery of Perceptual Schemas for Generalization in Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningObject | —Unverified | 0 |
| MARNET: Backdoor Attacks against Value-Decomposition Multi-Agent Reinforcement Learning | Sep 29, 2021 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Variance Reduced Domain Randomization for Policy Gradient | Sep 29, 2021 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| A Risk-Sensitive Policy Gradient Method | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Efficient Online 3D Bin Packing on Packing Configuration Trees | Sep 29, 2021 | 3D Bin PackingDeep Reinforcement Learning | CodeCode Available | 2 |
| Generalizing Successor Features to continuous domains for Multi-task Learning | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| The Remarkable Effectiveness of Combining Policy and Value Networks in A*-based Deep RL for AI Planning | Sep 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |