| Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon | Sep 28, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Multi-task Causal Learning with Gaussian Processes | Sep 27, 2020 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints | Sep 23, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Transfer Learning in Deep Reinforcement Learning: A Survey | Sep 16, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Causal Bandits without prior knowledge using separating sets | Sep 16, 2020 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Toward the Fundamental Limits of Imitation Learning | Sep 13, 2020 | Decision MakingImitation Learning | —Unverified | 0 |
| Optimal Inspection and Maintenance Planning for Deteriorating Structural Components through Dynamic Bayesian Networks and Markov Decision Processes | Sep 9, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Spatial Privacy Pricing: The Interplay between Privacy, Utility and Price in Geo-Marketplaces | Aug 25, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |