| Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping | Feb 21, 2024 | Decision MakingDecoder | CodeCode Available | 3 |
| Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban | Jun 11, 2025 | Sokoban | CodeCode Available | 1 |
| MageBench: Bridging Large Multimodal Models to Agents | Dec 5, 2024 | Sokoban | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Planning in a recurrent neural network that plays Sokoban | Jul 22, 2024 | Sokoban | CodeCode Available | 1 |
| Thinker: Learning to Plan and Act | Jul 27, 2023 | Sokoban | CodeCode Available | 1 |
| Levin Tree Search with Context Models | May 26, 2023 | Rubik's CubeSokoban | CodeCode Available | 1 |
| Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search | Jun 1, 2022 | Rubik's CubeSokoban | CodeCode Available | 1 |
| Illuminating Diverse Neural Cellular Automata for Level Generation | Sep 12, 2021 | DiversitySokoban | CodeCode Available | 1 |
| Subgoal Search For Complex Reasoning Tasks | Aug 25, 2021 | DiversityRubik's Cube | CodeCode Available | 1 |
| Classical Planning in Deep Latent Space | Jun 30, 2021 | Deep LearningSokoban | CodeCode Available | 1 |
| Policy-Guided Heuristic Search with Guarantees | Mar 21, 2021 | Heuristic SearchSokoban | CodeCode Available | 1 |
| Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning | Jun 4, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Multi-Objective level generator generation with Marahel | May 17, 2020 | Sokoban | CodeCode Available | 1 |
| Tree Search vs Optimization Approaches for Map Generation | Mar 27, 2019 | Evolutionary Algorithmsglobal-optimization | CodeCode Available | 1 |
| Learning to Search with MCTSnets | Feb 13, 2018 | Sokoban | CodeCode Available | 1 |
| Solving Sokoban using Hierarchical Reinforcement Learning with Landmarks | Apr 6, 2025 | Hierarchical Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Interpreting Emergent Planning in Model-Free Reinforcement Learning | Apr 2, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach | Sep 18, 2024 | Decision MakingHuman Agent Collaboration | —Unverified | 0 |
| A Training Data Recipe to Accelerate A* Search with Language Models | Jul 13, 2024 | Heuristic SearchLanguage Modelling | CodeCode Available | 0 |
| Autoverse: An Evolvable Game Language for Learning Robust Embodied Agents | Jul 5, 2024 | GPUImitation Learning | —Unverified | 0 |
| Interpreting Multi-objective Evolutionary Algorithms via Sokoban Level Generation | Jun 15, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| 3D Building Generation in Minecraft via Large Language Models | Jun 13, 2024 | MinecraftSokoban | CodeCode Available | 0 |
| AlphaZeroES: Direct score maximization outperforms planning loss minimization | Jun 12, 2024 | SokobanValue prediction | —Unverified | 0 |
| Human Goal Recognition as Bayesian Inference: Investigating the Impact of Actions, Timing, and Goal Solvability | Feb 16, 2024 | Bayesian InferenceSokoban | —Unverified | 0 |