| Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming | Jun 22, 2022 | Autonomous DrivingClassification | —Unverified | 0 |
| EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine | Jun 21, 2022 | MuJoCoreinforcement-learning | CodeCode Available | 5 |
| Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis | Jun 17, 2022 | MuJoCoStarcraft | —Unverified | 0 |
| Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning | Jun 15, 2022 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Relative Policy-Transition Optimization for Fast Policy Transfer | Jun 13, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies | Jun 12, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk | Jun 9, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning | Jun 4, 2022 | MuJoCoOff-policy evaluation | —Unverified | 0 |
| Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble | Jun 1, 2022 | Imitation LearningMuJoCo | —Unverified | 0 |
| Multi-Object Grasping in the Plane | Jun 1, 2022 | MuJoCoObject | —Unverified | 0 |