| An Alternative Softmax Operator for Reinforcement Learning | Dec 16, 2016 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| An empirical evaluation of active inference in multi-armed bandits | Jan 21, 2021 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 | 5 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions | Oct 15, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 | 5 |
| DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback | Oct 8, 2024 | MathSequential Decision Making | CodeCode Available | 1 | 5 |
| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 | 5 |
| Adaptive Stress Testing of Trajectory Predictions in Flight Management Systems | Nov 4, 2020 | Decision MakingManagement | CodeCode Available | 1 | 5 |
| Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback | Jul 20, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | Mar 29, 2024 | Decision MakingMamba | CodeCode Available | 1 | 5 |