| CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution | May 21, 2025 | Large Language ModelTask Planning | CodeCode Available | 1 |
| Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets | May 21, 2025 | Dataset GenerationDescriptive | —Unverified | 0 |
| Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent | May 20, 2025 | Task Planning | —Unverified | 0 |
| APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight | May 20, 2025 | Causal InferenceDecision Making | CodeCode Available | 0 |
| REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning? | May 16, 2025 | Large Language ModelRobot Task Planning | —Unverified | 0 |
| LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution | May 15, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents | May 2, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Apr 30, 2025 | Large Language ModelMotion Planning | —Unverified | 0 |
| LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Apr 30, 2025 | In-Context LearningObject | CodeCode Available | 1 |