| The Value Equivalence Principle for Model-Based Reinforcement Learning | Nov 6, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| The Value-Improvement Path: Towards Better Representations for Reinforcement Learning | Jun 3, 2020 | Atari Gamesreinforcement-learning | —Unverified | 0 | 0 |
| Towards a Better Understanding of Representation Dynamics under TD-learning | May 29, 2023 | Reinforcement Learning (RL)Representation Learning | —Unverified | 0 | 0 |
| Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface | Jun 8, 2021 | Text GenerationText to SQL | —Unverified | 0 | 0 |
| Understanding and Leveraging Overparameterization in Recursive Value Estimation | Sep 29, 2021 | Reinforcement Learning (RL)Value prediction | —Unverified | 0 | 0 |
| Using Contextual Information to Improve Blood Glucose Prediction | Aug 24, 2019 | Gaussian ProcessesManagement | —Unverified | 0 | 0 |
| Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning | Jun 25, 2022 | Contrastive LearningData Augmentation | —Unverified | 0 | 0 |
| Value-driven Hindsight Modelling | Feb 19, 2020 | Atari GamesReinforcement Learning | —Unverified | 0 | 0 |
| Value Prediction for Spatiotemporal Gait Data Using Deep Learning | Feb 29, 2024 | Deep LearningPrediction | —Unverified | 0 | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 | 0 |
| Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error | Jan 28, 2022 | Value prediction | —Unverified | 0 | 0 |
| X-model: Improving Data Efficiency in Deep Learning with A Minimax Model | Oct 9, 2021 | Age Estimationmodel | —Unverified | 0 | 0 |
| RCURRENCY: Live Digital Asset Trading Using a Recurrent Neural Network-based Forecasting System | Jun 13, 2021 | Value prediction | —Unverified | 0 | 0 |
| Acquiring Background Knowledge to Improve Moral Value Prediction | Sep 16, 2017 | PredictionValue prediction | —Unverified | 0 | 0 |
| AIRI: Predicting Retention Indices and their Uncertainties using Artificial Intelligence | Jan 3, 2024 | Value prediction | —Unverified | 0 | 0 |
| AlphaZeroES: Direct score maximization outperforms planning loss minimization | Jun 12, 2024 | SokobanValue prediction | —Unverified | 0 | 0 |
| A Meta-learning based Stacked Regression Approach for Customer Lifetime Value Prediction | Aug 7, 2023 | Meta-Learningregression | —Unverified | 0 | 0 |
| An Optimal Tightness Bound for the Simulation Lemma | Jun 24, 2024 | LEMMAValue prediction | —Unverified | 0 | 0 |
| A Closer Look at Deep Policy Gradients | Nov 6, 2018 | Value prediction | —Unverified | 0 | 0 |
| Associative Learning Mechanism for Drug-Target Interaction Prediction | May 24, 2022 | molecular representationPrediction | —Unverified | 0 | 0 |
| Attentive Continuous Generative Self-training for Unsupervised Domain Adaptive Medical Image Translation | May 23, 2023 | Domain AdaptationPseudo Label | —Unverified | 0 | 0 |
| Auction Design using Value Prediction with Hallucinations | Feb 12, 2025 | PredictionValue prediction | —Unverified | 0 | 0 |
| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 | 0 |
| Avoiding Confusion between Predictors and Inhibitors in Value Function Approximation | Dec 19, 2013 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Billion-user Customer Lifetime Value Prediction: An Industrial-scale Solution from Kuaishou | Aug 29, 2022 | Value prediction | —Unverified | 0 | 0 |