| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents | Feb 13, 2024 | BenchmarkingModel Selection | CodeCode Available | 2 |
| TrustAgent: Towards Safe and Trustworthy LLM-based Agents | Feb 2, 2024 | Task Planning | CodeCode Available | 2 |
| Getting pwn'd by AI: Penetration Testing with Large Language Models | Jul 24, 2023 | EthicsTask Planning | CodeCode Available | 2 |
| SkiROS2: A skill-based Robot Control Platform for ROS | Jun 29, 2023 | SchedulingTask Planning | CodeCode Available | 2 |
| Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents | Feb 3, 2023 | MinecraftTask Planning | CodeCode Available | 2 |
| FlySearch: Exploring how vision-language models explore | Jun 3, 2025 | HallucinationTask Planning | CodeCode Available | 1 |
| BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVs | May 23, 2025 | Model OptimizationTask Planning | CodeCode Available | 1 |
| CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution | May 21, 2025 | Large Language ModelTask Planning | CodeCode Available | 1 |
| LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Apr 30, 2025 | In-Context LearningObject | CodeCode Available | 1 |
| Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution | Apr 23, 2025 | Task Planning | CodeCode Available | 1 |
| Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback | Mar 27, 2025 | Task Planning | CodeCode Available | 1 |
| LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language | Mar 21, 2025 | In-Context LearningRobot Task Planning | CodeCode Available | 1 |
| MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration | Feb 25, 2025 | Robot Task PlanningTask Planning | CodeCode Available | 1 |
| Plan-over-Graph: Towards Parallelable LLM Agent Schedule | Feb 20, 2025 | Task Planning | CodeCode Available | 1 |
| Robotouille: An Asynchronous Planning Benchmark for LLM Agents | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Models for Multi-Robot Systems: A Survey | Feb 6, 2025 | Action GenerationBenchmarking | CodeCode Available | 1 |
| VLM-driven Behavior Tree for Context-aware Task Planning | Jan 7, 2025 | Task Planning | CodeCode Available | 1 |
| Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples | Dec 23, 2024 | Common Sense ReasoningTask Planning | CodeCode Available | 1 |
| Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models | Sep 28, 2024 | Drone navigationRobot Manipulation | CodeCode Available | 1 |
| EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents | Aug 8, 2024 | Scene GenerationTask Planning | CodeCode Available | 1 |
| Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs | Jul 26, 2024 | Action GenerationLarge Language Model | CodeCode Available | 1 |
| DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control | Jul 20, 2024 | Instruction FollowingNavigate | CodeCode Available | 1 |
| Can only LLMs do Reasoning?: Potential of Small Language Models in Task Planning | Apr 5, 2024 | Task Planning | CodeCode Available | 1 |