| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 |
| Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error | Jan 28, 2022 | Value prediction | —Unverified | 0 |
| CoRGi: Content-Rich Graph Neural Networks with Attention | Oct 10, 2021 | ImputationValue prediction | —Unverified | 0 |
| X-model: Improving Data Efficiency in Deep Learning with A Minimax Model | Oct 9, 2021 | Age Estimationmodel | —Unverified | 0 |
| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Oct 4, 2021 | Adroid door-clonedAdroid door-human | CodeCode Available | 1 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| Understanding and Leveraging Overparameterization in Recursive Value Estimation | Sep 29, 2021 | Reinforcement Learning (RL)Value prediction | —Unverified | 0 |
| On the Estimation Bias in Double Q-Learning | Sep 29, 2021 | Q-LearningValue prediction | CodeCode Available | 0 |
| Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis | Jun 23, 2021 | Domain AdaptationImage Generation | —Unverified | 0 |
| RCURRENCY: Live Digital Asset Trading Using a Recurrent Neural Network-based Forecasting System | Jun 13, 2021 | Value prediction | —Unverified | 0 |
| Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface | Jun 8, 2021 | Text GenerationText to SQL | —Unverified | 0 |
| On the Optimality of Batch Policy Optimization Algorithms | Apr 6, 2021 | Value prediction | —Unverified | 0 |
| Learning State Representations from Random Deep Action-conditional Predictions | Feb 9, 2021 | Atari GamesReinforcement Learning (RL) | CodeCode Available | 0 |
| The Value Equivalence Principle for Model-Based Reinforcement Learning | Nov 6, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Rethinking Deep Policy Gradients via State-Wise Policy Improvement | Oct 19, 2020 | Policy Gradient MethodsValue prediction | —Unverified | 0 |
| DATE: Dual Attentive Tree-aware Embedding for Customs Fraud Detection | Aug 23, 2020 | Fraud DetectionMulti-target regression | CodeCode Available | 1 |
| timeXplain -- A Framework for Explaining the Predictions of Time Series Classifiers | Jul 15, 2020 | Decision MakingExplainable artificial intelligence | CodeCode Available | 0 |
| PIVEN: A Deep Neural Network for Prediction Intervals with Specific Value Prediction | Jun 9, 2020 | PredictionPrediction Intervals | CodeCode Available | 1 |
| The Value-Improvement Path: Towards Better Representations for Reinforcement Learning | Jun 3, 2020 | Atari Gamesreinforcement-learning | —Unverified | 0 |
| Spatial Action Maps for Mobile Manipulation | Apr 20, 2020 | Q-LearningValue prediction | CodeCode Available | 1 |
| Value-driven Hindsight Modelling | Feb 19, 2020 | Atari GamesReinforcement Learning | —Unverified | 0 |
| Using Contextual Information to Improve Blood Glucose Prediction | Aug 24, 2019 | Gaussian ProcessesManagement | —Unverified | 0 |
| Forecasting Wireless Demand with Extreme Values using Feature Embedding in Gaussian Processes | May 15, 2019 | Gaussian ProcessesTraffic Prediction | —Unverified | 0 |
| ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search | Nov 6, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Closer Look at Deep Policy Gradients | Nov 6, 2018 | Value prediction | —Unverified | 0 |
| Spatial Correlation and Value Prediction in Convolutional Neural Networks | Jul 21, 2018 | General Classificationimage-classification | —Unverified | 0 |
| Reliability and Sharpness in Border Crossing Traffic Interval Prediction | Nov 13, 2017 | ManagementPrediction | —Unverified | 0 |
| TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning | Oct 31, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Acquiring Background Knowledge to Improve Moral Value Prediction | Sep 16, 2017 | PredictionValue prediction | —Unverified | 0 |
| Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs | Aug 16, 2017 | AttributeKnowledge Graphs | —Unverified | 0 |
| Value Prediction Network | Jul 11, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Customer Lifetime Value Prediction Using Embeddings | Mar 7, 2017 | MarketingPrediction | —Unverified | 0 |
| Avoiding Confusion between Predictors and Inhibitors in Value Function Approximation | Dec 19, 2013 | Decision MakingReinforcement Learning | —Unverified | 0 |