| Billion-user Customer Lifetime Value Prediction: An Industrial-scale Solution from Kuaishou | Aug 29, 2022 | Value prediction | —Unverified | 0 |
| Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning | Jun 25, 2022 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Associative Learning Mechanism for Drug-Target Interaction Prediction | May 24, 2022 | molecular representationPrediction | —Unverified | 0 |
| Region of Interest focused MRI to Synthetic CT Translation using Regression and Classification Multi-task Network | Mar 30, 2022 | regressionValue prediction | —Unverified | 0 |
| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 |
| Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error | Jan 28, 2022 | Value prediction | —Unverified | 0 |
| CoRGi: Content-Rich Graph Neural Networks with Attention | Oct 10, 2021 | ImputationValue prediction | —Unverified | 0 |
| X-model: Improving Data Efficiency in Deep Learning with A Minimax Model | Oct 9, 2021 | Age Estimationmodel | —Unverified | 0 |
| On the Estimation Bias in Double Q-Learning | Sep 29, 2021 | Q-LearningValue prediction | CodeCode Available | 0 |
| Understanding and Leveraging Overparameterization in Recursive Value Estimation | Sep 29, 2021 | Reinforcement Learning (RL)Value prediction | —Unverified | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis | Jun 23, 2021 | Domain AdaptationImage Generation | —Unverified | 0 |
| RCURRENCY: Live Digital Asset Trading Using a Recurrent Neural Network-based Forecasting System | Jun 13, 2021 | Value prediction | —Unverified | 0 |
| Turing: an Accurate and Interpretable Multi-Hypothesis Cross-Domain Natural Language Database Interface | Jun 8, 2021 | Text GenerationText to SQL | —Unverified | 0 |
| On the Optimality of Batch Policy Optimization Algorithms | Apr 6, 2021 | Value prediction | —Unverified | 0 |
| Learning State Representations from Random Deep Action-conditional Predictions | Feb 9, 2021 | Atari GamesReinforcement Learning (RL) | CodeCode Available | 0 |
| The Value Equivalence Principle for Model-Based Reinforcement Learning | Nov 6, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Rethinking Deep Policy Gradients via State-Wise Policy Improvement | Oct 19, 2020 | Policy Gradient MethodsValue prediction | —Unverified | 0 |
| timeXplain -- A Framework for Explaining the Predictions of Time Series Classifiers | Jul 15, 2020 | Decision MakingExplainable artificial intelligence | CodeCode Available | 0 |
| The Value-Improvement Path: Towards Better Representations for Reinforcement Learning | Jun 3, 2020 | Atari Gamesreinforcement-learning | —Unverified | 0 |
| Value-driven Hindsight Modelling | Feb 19, 2020 | Atari GamesReinforcement Learning | —Unverified | 0 |
| Using Contextual Information to Improve Blood Glucose Prediction | Aug 24, 2019 | Gaussian ProcessesManagement | —Unverified | 0 |
| Forecasting Wireless Demand with Extreme Values using Feature Embedding in Gaussian Processes | May 15, 2019 | Gaussian ProcessesTraffic Prediction | —Unverified | 0 |
| ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search | Nov 6, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A Closer Look at Deep Policy Gradients | Nov 6, 2018 | Value prediction | —Unverified | 0 |
| Spatial Correlation and Value Prediction in Convolutional Neural Networks | Jul 21, 2018 | General Classificationimage-classification | —Unverified | 0 |
| Reliability and Sharpness in Border Crossing Traffic Interval Prediction | Nov 13, 2017 | ManagementPrediction | —Unverified | 0 |
| TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning | Oct 31, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Acquiring Background Knowledge to Improve Moral Value Prediction | Sep 16, 2017 | PredictionValue prediction | —Unverified | 0 |
| Multi-task Neural Network for Non-discrete Attribute Prediction in Knowledge Graphs | Aug 16, 2017 | AttributeKnowledge Graphs | —Unverified | 0 |
| Value Prediction Network | Jul 11, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Customer Lifetime Value Prediction Using Embeddings | Mar 7, 2017 | MarketingPrediction | —Unverified | 0 |
| Avoiding Confusion between Predictors and Inhibitors in Value Function Approximation | Dec 19, 2013 | Decision MakingReinforcement Learning | —Unverified | 0 |