| The Choice Function Framework for Online Policy Improvement | Oct 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems | Oct 1, 2019 | Decision MakingManagement | —Unverified | 0 |
| PROVABLY BENEFITS OF DEEP HIERARCHICAL RL | Sep 25, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning | Sep 25, 2019 | Decision MakingKnowledge Distillation | —Unverified | 0 |
| Learning Functionally Decomposed Hierarchies for Continuous Navigation Tasks | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Generalizing Reinforcement Learning to Unseen Actions | Sep 25, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Selective Network Discovery via Deep Reinforcement Learning on Embedded Spaces | Sep 16, 2019 | AttributeDecision Making | —Unverified | 0 |
| Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees | Sep 11, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Back to the Future -- Sequential Alignment of Text Representations | Sep 8, 2019 | Decision MakingRumour Detection | CodeCode Available | 0 |
| Classification with Costly Features as a Sequential Decision-Making Problem | Sep 5, 2019 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits | Sep 5, 2019 | Decision MakingRecommendation Systems | —Unverified | 0 |
| Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control | Sep 4, 2019 | Decision MakingOpen-Ended Question Answering | CodeCode Available | 0 |
| Can A User Anticipate What Her Followers Want? | Sep 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Interactive Machine Comprehension with Information Seeking Agents | Aug 27, 2019 | Decision MakingInformation Retrieval | CodeCode Available | 0 |
| Reinforcement Learning in Healthcare: A Survey | Aug 22, 2019 | Decision MakingMedical Diagnosis | —Unverified | 0 |
| Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem | Aug 21, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Online Planning for Decentralized Stochastic Control with Partial History Sharing | Aug 6, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language | Jul 31, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation | Jul 30, 2019 | Decision MakingLearning-To-Rank | CodeCode Available | 0 |
| Bandit Convex Optimization in Non-stationary Environments | Jul 29, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Scaling Multi-Armed Bandit Algorithms | Jul 25, 2019 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL | Jul 24, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| A Sufficient Statistic for Influence in Structured Multiagent Environments | Jul 22, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reward Advancement: Transforming Policy under Maximum Causal Entropy Principle | Jul 11, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Scheme for Dynamic Risk-Sensitive Sequential Decision Making | Jul 9, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |