| A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding | Apr 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding | Mar 12, 2020 | Decision MakingManagement | CodeCode Available | 0 |
| Learning Discrete State Abstractions With Deep Variational Inference | Mar 9, 2020 | Decision MakingMulti-Goal Reinforcement Learning | CodeCode Available | 0 |
| Human AI interaction loop training: New approach for interactive reinforcement learning | Mar 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option | Mar 6, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Exploration-Exploitation in Constrained MDPs | Mar 4, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Structure-Adaptive Sequential Testing for Online False Discovery Rate Control | Feb 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes | Feb 27, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Information Directed Sampling for Linear Partial Monitoring | Feb 25, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Learning Dynamic Belief Graphs to Generalize on Text-Based Games | Feb 21, 2020 | Decision MakingKnowledge Graphs | CodeCode Available | 1 |
| Online Batch Decision-Making with High-Dimensional Covariates | Feb 21, 2020 | Decision MakingMarketing | —Unverified | 0 |
| Weakly-supervised Multi-output Regression via Correlated Gaussian Processes | Feb 19, 2020 | Decision MakingGaussian Processes | —Unverified | 0 |
| Legion: Best-First Concolic Testing | Feb 15, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| PDDLGym: Gym Environments from PDDL Problems | Feb 15, 2020 | Decision MakingOpenAI Gym | CodeCode Available | 1 |
| Listwise Learning to Rank with Deep Q-Networks | Feb 13, 2020 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints | Feb 13, 2020 | Decision MakingDiagnostic | —Unverified | 0 |
| Tight Lower Bounds for Combinatorial Multi-Armed Bandits | Feb 13, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic | Feb 13, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning | Feb 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making | Feb 5, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations | Feb 4, 2020 | Decision MakingMontezuma's Revenge | —Unverified | 0 |