| Learning by Repetition: Stochastic Multi-armed Bandits under Priming Effect | Jun 18, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework | Jun 17, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| On the Relationship Between Structure in Natural Language and Models of Sequential Decision Processes | Jun 12, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Recurrent Sum-Product-Max Networks for Decision Making in Perfectly-Observed Environments | Jun 12, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Group-Fair Online Allocation in Continuous Time | Jun 11, 2020 | Cloud ComputingDecision Making | —Unverified | 0 |
| Modeling Human Driving Behavior through Generative Adversarial Imitation Learning | Jun 10, 2020 | Decision MakingDisentanglement | —Unverified | 0 |
| When is Particle Filtering Efficient for Planning in Partially Observed Linear Dynamical Systems? | Jun 10, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Stealing Deep Reinforcement Learning Models for Fun and Profit | Jun 9, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Sharp Thresholds of the Information Cascade Fragility Under a Mismatched Model | Jun 7, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| When Does MAML Objective Have Benign Landscape? | May 31, 2020 | Decision MakingMeta-Learning | —Unverified | 0 |
| Reinforcement Learning | May 29, 2020 | Autonomous VehiclesBoard Games | CodeCode Available | 0 |
| Dynamic Bi-Objective Routing of Multiple Vehicles | May 28, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints | May 27, 2020 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Active Measure Reinforcement Learning for Observation Cost Minimization | May 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Causal Bayesian Optimization | May 24, 2020 | Bayesian OptimizationCausal Inference | —Unverified | 0 |
| Implementability of Honest Multi-Agent Sequential Decision-Making with Dynamic Population | May 19, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning | May 15, 2020 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Scalable First-Order Methods for Robust MDPs | May 11, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL | May 10, 2020 | Decision MakingLifelong learning | CodeCode Available | 1 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| iCORPP: Interleaved Commonsense Reasoning and Probabilistic Planning on Robots | Apr 18, 2020 | Decision MakingManagement | —Unverified | 0 |
| Actor-Critic Deep Reinforcement Learning for Solving Job Shop Scheduling Problems | Apr 14, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Sequential Batch Learning in Finite-Action Linear Contextual Bandits | Apr 14, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Distributed Learning: Sequential Decision Making in Resource-Constrained Environments | Apr 13, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |