| Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem | Apr 22, 2021 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| LLF-Bench: Benchmark for Interactive Learning from Language Feedback | Dec 11, 2023 | Information RetrievalOpenAI Gym | CodeCode Available | 1 |
| Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents | Nov 22, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis | Jan 30, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints | May 27, 2020 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision Making | Feb 5, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Effective Reinforcement Learning through Evolutionary Surrogate-Assisted Prescription | Feb 13, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | Mar 29, 2024 | Decision MakingMamba | CodeCode Available | 1 |
| DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback | Oct 8, 2024 | MathSequential Decision Making | CodeCode Available | 1 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 |
| Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step Trees | Jun 29, 2020 | Bayesian OptimizationDecision Making | CodeCode Available | 1 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Counterfactual Explanations in Sequential Decision Making Under Uncertainty | Jul 6, 2021 | counterfactualCounterfactual Explanation | CodeCode Available | 1 |
| Curriculum-based Reinforcement Learning for Distribution System Critical Load Restoration | Mar 8, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Jun 9, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 |
| Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains | Apr 20, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Dynamic Causal Bayesian Optimization | Oct 26, 2021 | Bayesian OptimizationCausal Inference | CodeCode Available | 1 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach | Jun 6, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 1 |
| Extracting Reward Functions from Diffusion Models | Jun 1, 2023 | Decision MakingImage Generation | CodeCode Available | 1 |
| Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI Gym | Dec 6, 2023 | BenchmarkingDecision Making | CodeCode Available | 1 |