| A Deep Reinforcement Learning Framework for Continuous Intraday Market Bidding | Apr 13, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations | Apr 1, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding | Mar 12, 2020 | Decision MakingManagement | CodeCode Available | 0 |
| Learning Discrete State Abstractions With Deep Variational Inference | Mar 9, 2020 | Decision MakingMulti-Goal Reinforcement Learning | CodeCode Available | 0 |
| Human AI interaction loop training: New approach for interactive reinforcement learning | Mar 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| A Farewell to Arms: Sequential Reward Maximization on a Budget with a Giving Up Option | Mar 6, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Exploration-Exploitation in Constrained MDPs | Mar 4, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Structure-Adaptive Sequential Testing for Online False Discovery Rate Control | Feb 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning of Risk-Constrained Policies in Markov Decision Processes | Feb 27, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Information Directed Sampling for Linear Partial Monitoring | Feb 25, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Learning Dynamic Belief Graphs to Generalize on Text-Based Games | Feb 21, 2020 | Decision MakingKnowledge Graphs | CodeCode Available | 1 |
| Online Batch Decision-Making with High-Dimensional Covariates | Feb 21, 2020 | Decision MakingMarketing | —Unverified | 0 |
| Weakly-supervised Multi-output Regression via Correlated Gaussian Processes | Feb 19, 2020 | Decision MakingGaussian Processes | —Unverified | 0 |
| Legion: Best-First Concolic Testing | Feb 15, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| PDDLGym: Gym Environments from PDDL Problems | Feb 15, 2020 | Decision MakingOpenAI Gym | CodeCode Available | 1 |
| Listwise Learning to Rank with Deep Q-Networks | Feb 13, 2020 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints | Feb 13, 2020 | Decision MakingDiagnostic | —Unverified | 0 |
| Tight Lower Bounds for Combinatorial Multi-Armed Bandits | Feb 13, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic | Feb 13, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning | Feb 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making | Feb 5, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations | Feb 4, 2020 | Decision MakingMontezuma's Revenge | —Unverified | 0 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Fairness in Learning-Based Sequential Decision Algorithms: A Survey | Jan 14, 2020 | Decision MakingFairness | —Unverified | 0 |
| Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings | Jan 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| A storage expansion planning framework using reinforcement learning and simulation-based optimization | Jan 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| On Computation and Generalization of Generative Adversarial Imitation Learning | Jan 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances | Dec 9, 2019 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR | Dec 3, 2019 | Decision MakingReinforcement Learning | CodeCode Available | 0 |
| Maximum Entropy Monte-Carlo Planning | Dec 1, 2019 | Atari GamesDecision Making | —Unverified | 0 |
| SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies | Dec 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 |
| Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms | Nov 24, 2019 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Planning with Goal-Conditioned Policies | Nov 19, 2019 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Working Memory Graphs | Nov 17, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| One-shot learning and behavioral eligibility traces in sequential decision making | Nov 12, 2019 | Decision MakingLearning Theory | —Unverified | 0 |
| A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data | Nov 11, 2019 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Adaptivity in Adaptive Submodularity | Nov 9, 2019 | Active LearningDecision Making | —Unverified | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints | Nov 2, 2019 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Thompson Sampling via Local Uncertainty | Oct 30, 2019 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions | Oct 15, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python | Oct 4, 2019 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |