| Thinker: Learning to Plan and Act | Jul 27, 2023 | Sokoban | CodeCode Available | 1 | 5 |
| Multi-Objective level generator generation with Marahel | May 17, 2020 | Sokoban | CodeCode Available | 1 | 5 |
| Learning to Search with MCTSnets | Feb 13, 2018 | Sokoban | CodeCode Available | 1 | 5 |
| Fast and Precise: Adjusting Planning Horizon with Adaptive Subgoal Search | Jun 1, 2022 | Rubik's CubeSokoban | CodeCode Available | 1 | 5 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 | 5 |
| Interpreting learned search: finding a transition model and value function in an RNN that plays Sokoban | Jun 11, 2025 | Sokoban | CodeCode Available | 1 | 5 |
| ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search | Feb 3, 2022 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| A Training Data Recipe to Accelerate A* Search with Language Models | Jul 13, 2024 | Heuristic SearchLanguage Modelling | CodeCode Available | 0 | 5 |
| On Grid Graph Reachability and Puzzle Games | Oct 2, 2023 | Sokoban | CodeCode Available | 0 | 5 |
| Physically Embedded Planning Problems: New Challenges for Reinforcement Learning | Sep 11, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |