| Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping | Feb 21, 2024 | Decision MakingDecoder | CodeCode Available | 3 | 5 |
| Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban | Jun 11, 2025 | Sokoban | CodeCode Available | 1 | 5 |
| Subgoal Search For Complex Reasoning Tasks | Aug 25, 2021 | DiversityRubik's Cube | CodeCode Available | 1 | 5 |
| Multi-Objective level generator generation with Marahel | May 17, 2020 | Sokoban | CodeCode Available | 1 | 5 |
| Policy-Guided Heuristic Search with Guarantees | Mar 21, 2021 | Heuristic SearchSokoban | CodeCode Available | 1 | 5 |
| Illuminating Diverse Neural Cellular Automata for Level Generation | Sep 12, 2021 | DiversitySokoban | CodeCode Available | 1 | 5 |
| Tree Search vs Optimization Approaches for Map Generation | Mar 27, 2019 | Evolutionary Algorithmsglobal-optimization | CodeCode Available | 1 | 5 |
| Thinker: Learning to Plan and Act | Jul 27, 2023 | Sokoban | CodeCode Available | 1 | 5 |
| Planning in a recurrent neural network that plays Sokoban | Jul 22, 2024 | Sokoban | CodeCode Available | 1 | 5 |
| Classical Planning in Deep Latent Space | Jun 30, 2021 | Deep LearningSokoban | CodeCode Available | 1 | 5 |
| MageBench: Bridging Large Multimodal Models to Agents | Dec 5, 2024 | Sokoban | CodeCode Available | 1 | 5 |
| Solving Hard AI Planning Instances Using Curriculum-Driven Deep Reinforcement Learning | Jun 4, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Levin Tree Search with Context Models | May 26, 2023 | Rubik's CubeSokoban | CodeCode Available | 1 | 5 |
| Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search | Jun 1, 2022 | Rubik's CubeSokoban | CodeCode Available | 1 | 5 |
| Learning to Search with MCTSnets | Feb 13, 2018 | Sokoban | CodeCode Available | 1 | 5 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 | 5 |
| ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search | Feb 3, 2022 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| A Training Data Recipe to Accelerate A* Search with Language Models | Jul 13, 2024 | Heuristic SearchLanguage Modelling | CodeCode Available | 0 | 5 |
| Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks | Oct 6, 2018 | AllMontezuma's Revenge | CodeCode Available | 0 | 5 |
| Single-Agent Policy Tree Search With Guarantees | Nov 27, 2018 | Heuristic SearchSokoban | CodeCode Available | 0 | 5 |
| Mixed-Initiative Level Design with RL Brush | Aug 6, 2020 | reinforcement-learningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| An investigation of model-free planning | Jan 11, 2019 | Inductive Biasmodel | CodeCode Available | 0 | 5 |
| On Grid Graph Reachability and Puzzle Games | Oct 2, 2023 | Sokoban | CodeCode Available | 0 | 5 |
| Inductive general game playing | Jun 23, 2019 | Inductive logic programmingSokoban | CodeCode Available | 0 | 5 |
| 3D Building Generation in Minecraft via Large Language Models | Jun 13, 2024 | MinecraftSokoban | CodeCode Available | 0 | 5 |