| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints | May 27, 2020 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| Hybrid Multi-agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem | Apr 22, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym | Dec 6, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents | Nov 22, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| An Alternative Softmax Operator for Reinforcement Learning | Dec 16, 2016 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions | Oct 15, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| RELIEF: Reinforcement Learning Empowered Graph Feature Prompt Tuning | Aug 6, 2024 | Combinatorial OptimizationGraph Neural Network | CodeCode Available | 1 |
| Markup-to-Image Diffusion Models with Scheduled Sampling | Oct 11, 2022 | Decision MakingDenoising | CodeCode Available | 1 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback | Jul 20, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| An empirical evaluation of active inference in multi-armed bandits | Jan 21, 2021 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 |
| Multi-task Causal Learning with Gaussian Processes | Sep 27, 2020 | Active LearningBayesian Optimization | CodeCode Available | 1 |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Jun 9, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| PDDLGym: Gym Environments from PDDL Problems | Feb 15, 2020 | Decision MakingOpenAI Gym | CodeCode Available | 1 |
| PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control | Feb 16, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Pursuing Overall Welfare in Federated Learning through Sequential Decision Making | May 31, 2024 | Decision MakingFairness | CodeCode Available | 1 |
| Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer | Mar 12, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| A Generative Machine Learning Approach to Policy Optimization in Pursuit-Evasion Games | Oct 4, 2020 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |