| Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning | Sep 12, 2024 | Decision MakingManagement | —Unverified | 0 | 0 |
| Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies | Nov 29, 2020 | Off-policy evaluationRecommendation Systems | —Unverified | 0 | 0 |
| Optimal Transport-Assisted Risk-Sensitive Q-Learning | Jun 17, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Parenting: Safe Reinforcement Learning from Human Input | Feb 18, 2019 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Penalized Proximal Policy Optimization for Safe Reinforcement Learning | May 24, 2022 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning | Dec 17, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |