| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 | 0 |
| Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error | Jan 28, 2022 | Value prediction | —Unverified | 0 | 0 |
| X-model: Improving Data Efficiency in Deep Learning with A Minimax Model | Oct 9, 2021 | Age Estimationmodel | —Unverified | 0 | 0 |
| RCURRENCY: Live Digital Asset Trading Using a Recurrent Neural Network-based Forecasting System | Jun 13, 2021 | Value prediction | —Unverified | 0 | 0 |
| Acquiring Background Knowledge to Improve Moral Value Prediction | Sep 16, 2017 | PredictionValue prediction | —Unverified | 0 | 0 |
| AIRI: Predicting Retention Indices and their Uncertainties using Artificial Intelligence | Jan 3, 2024 | Value prediction | —Unverified | 0 | 0 |
| AlphaZeroES: Direct score maximization outperforms planning loss minimization | Jun 12, 2024 | SokobanValue prediction | —Unverified | 0 | 0 |
| A Meta-learning based Stacked Regression Approach for Customer Lifetime Value Prediction | Aug 7, 2023 | Meta-Learningregression | —Unverified | 0 | 0 |
| An Optimal Tightness Bound for the Simulation Lemma | Jun 24, 2024 | LEMMAValue prediction | —Unverified | 0 | 0 |
| A Closer Look at Deep Policy Gradients | Nov 6, 2018 | Value prediction | —Unverified | 0 | 0 |