| Solving Robust Markov Decision Processes: Generic, Reliable, Efficient | Dec 13, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| How Should We Represent History in Interpretable Models of Clinical Policies? | Dec 10, 2024 | Decision MakingRepresentation Learning | CodeCode Available | 0 |
| Effective Reward Specification in Deep Reinforcement Learning | Dec 10, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Swarm Behavior Cloning | Dec 10, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Optimizing Sensor Redundancy in Sequential Decision-Making Problems | Dec 10, 2024 | Decision MakingOpenAI Gym | —Unverified | 0 |
| Discrete-Time Distribution Steering using Monte Carlo Tree Search | Dec 9, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| A Note on Sample Complexity of Interactive Imitation Learning with Log Loss | Dec 9, 2024 | Decision MakingImitation Learning | —Unverified | 0 |
| Conservative Contextual Bandits: Beyond Linear Representations | Dec 9, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning: An Overview | Dec 6, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Nonmyopic Global Optimisation via Approximate Dynamic Programming | Dec 6, 2024 | Bayesian OptimisationGaussian Processes | CodeCode Available | 0 |