| On Reward Structures of Markov Decision Processes | Aug 28, 2023 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions | Feb 10, 2021 | Anomaly DetectionSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Optimal Management of Grid-Interactive Efficient Buildings via Safe Reinforcement Learning | Sep 12, 2024 | Decision MakingManagement | —Unverified | 0 | 0 |
| Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies | Nov 29, 2020 | Off-policy evaluationRecommendation Systems | —Unverified | 0 | 0 |
| Optimal Transport-Assisted Risk-Sensitive Q-Learning | Jun 17, 2024 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Parenting: Safe Reinforcement Learning from Human Input | Feb 18, 2019 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |