| Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies | Nov 29, 2020 | Off-policy evaluationRecommendation Systems | —Unverified | 0 |
| CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee | Nov 11, 2020 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 |
| Recovery RL: Safe Reinforcement Learning with Learned Recovery Zones | Oct 29, 2020 | Contact-rich Manipulationreinforcement-learning | CodeCode Available | 1 |
| Constrained Model-based Reinforcement Learning with Robust Cross-Entropy Method | Oct 15, 2020 | Model-based Reinforcement LearningModel Predictive Control | CodeCode Available | 1 |
| Remote Electrical Tilt Optimization via Safe Reinforcement Learning | Oct 12, 2020 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning with Natural Language Constraints | Oct 11, 2020 | Autonomous Navigationreinforcement-learning | —Unverified | 0 |
| A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis | Sep 28, 2020 | Safe Reinforcement Learning | —Unverified | 0 |
| Safe Reinforcement Learning in Constrained Markov Decision Processes | Aug 15, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| Safe Model-Based Reinforcement Learning for Systems with Parametric Uncertainties | Jul 24, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Responsive Safety in Reinforcement Learning by PID Lagrangian Methods | Jul 8, 2020 | reinforcement-learningReinforcement Learning | —Unverified | 0 |