| Legion: Best-First Concolic Testing | Feb 15, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Verifiable RNN-Based Policies for POMDPs Under Temporal Logic Constraints | Feb 13, 2020 | Decision MakingDiagnostic | —Unverified | 0 |
| Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic | Feb 13, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Tight Lower Bounds for Combinatorial Multi-Armed Bandits | Feb 13, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Listwise Learning to Rank with Deep Q-Networks | Feb 13, 2020 | Decision MakingLearning-To-Rank | —Unverified | 0 |
| Accelerating Reinforcement Learning for Reaching using Continuous Curriculum Learning | Feb 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Bridging the Gap: Providing Post-Hoc Symbolic Explanations for Sequential Decision-Making Problems with Inscrutable Representations | Feb 4, 2020 | Decision MakingMontezuma's Revenge | —Unverified | 0 |
| Computing the Feedback Capacity of Finite State Channels using Reinforcement Learning | Jan 27, 2020 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Fairness in Learning-Based Sequential Decision Algorithms: A Survey | Jan 14, 2020 | Decision MakingFairness | —Unverified | 0 |
| Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings | Jan 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| A storage expansion planning framework using reinforcement learning and simulation-based optimization | Jan 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| On Computation and Generalization of Generative Adversarial Imitation Learning | Jan 9, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Decentralized Multi-Agent Reinforcement Learning with Networked Agents: Recent Advances | Dec 9, 2019 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Risk-Averse Action Selection Using Extreme Value Theory Estimates of the CVaR | Dec 3, 2019 | Decision MakingReinforcement Learning | CodeCode Available | 0 |
| Maximum Entropy Monte-Carlo Planning | Dec 1, 2019 | Atari GamesDecision Making | —Unverified | 0 |
| SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies | Dec 1, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| An Optimized and Energy-Efficient Parallel Implementation of Non-Iteratively Trained Recurrent Neural Networks | Nov 26, 2019 | Decision MakingGPU | —Unverified | 0 |
| Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms | Nov 24, 2019 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Planning with Goal-Conditioned Policies | Nov 19, 2019 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Working Memory Graphs | Nov 17, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| One-shot learning and behavioral eligibility traces in sequential decision making | Nov 12, 2019 | Decision MakingLearning Theory | —Unverified | 0 |
| A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data | Nov 11, 2019 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Adaptivity in Adaptive Submodularity | Nov 9, 2019 | Active LearningDecision Making | —Unverified | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |