Natural Language Planning via Coding and Inference Scaling

2025-05-19Code Available0· sign in to hype

Rikhil Amonkar, Ronan Le Bras, Li Zhang

Code Available — Be the first to reproduce this paper.

Code

github.com/cassiehuang22/llm-as-pddl-formalizer
OfficialIn paperpytorch★ 5

Abstract

Real-life textual planning tasks such as meeting scheduling have posed much challenge to LLMs especially when the complexity is high. While previous work primarily studied auto-regressive generation of plans with closed-source models, we systematically evaluate both closed- and open-source models, including those that scales output length with complexity during inference, in generating programs, which are executed to output the plan. We consider not only standard Python code, but also the code to a constraint satisfaction problem solver. Despite the algorithmic nature of the task, we show that programming often but not always outperforms planning. Our detailed error analysis also indicates a lack of robustness and efficiency in the generated code that hinders generalization.

Tasks

Scheduling

Natural Language Planning via Coding and Inference Scaling

Code

Abstract

Tasks

Reproductions