| Comparing Deep Reinforcement Learning Algorithms in Two-Echelon Supply Chains | Apr 20, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models | Jun 9, 2023 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback | Oct 8, 2024 | MathSequential Decision Making | CodeCode Available | 1 | 5 |
| Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces | Mar 29, 2024 | Decision MakingMamba | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks | May 15, 2025 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 | 5 |
| AdaPlanner: Adaptive Planning from Feedback with Language Models | May 26, 2023 | Decision MakingHallucination | CodeCode Available | 1 | 5 |
| ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation | Jul 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in Coq | Sep 23, 2020 | Decision Makingreinforcement-learning | CodeCode Available | 1 | 5 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |