| Circuit Routing Using Monte Carlo Tree Search and Deep Neural Networks | Jun 24, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Risk-Sensitive Reinforcement Learning: a Martingale Approach to Reward Uncertainty | Jun 23, 2020 | Decision MakingPortfolio Optimization | —Unverified | 0 |
| Towards Tractable Optimism in Model-Based Reinforcement Learning | Jun 21, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Counterfactually Guided Off-policy Transfer in Clinical Settings | Jun 20, 2020 | counterfactualDecision Making | —Unverified | 0 |
| Frequentist Uncertainty in Recurrent Neural Networks via Blockwise Influence Functions | Jun 20, 2020 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect | Jun 18, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework | Jun 17, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes | Jun 12, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments | Jun 12, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |