| EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning | Dec 11, 2023 | BenchmarkingHuman-Object Interaction Detection | CodeCode Available | 1 |
| Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty | Dec 2, 2023 | DenoisingTask Planning | CodeCode Available | 1 |
| Physical Reasoning and Object Planning for Household Embodied Agents | Nov 22, 2023 | 2kDecision Making | CodeCode Available | 1 |
| Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs | Nov 6, 2023 | Imitation LearningIn-Context Learning | CodeCode Available | 1 |
| Vision-Language Interpreter for Robot Task Planning | Nov 2, 2023 | Robot Task PlanningTask Planning | CodeCode Available | 1 |
| New Interaction Paradigm for Complex EDA Software Leveraging GPT | Jul 27, 2023 | Task Planning | CodeCode Available | 1 |
| Embodied Task Planning with Large Language Models | Jul 4, 2023 | Task Planning | CodeCode Available | 1 |
| SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models | May 30, 2023 | BenchmarkingCode Generation | CodeCode Available | 1 |
| Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds | May 27, 2023 | Task PlanningWorld Knowledge | CodeCode Available | 1 |
| A Multi-modal Garden Dataset and Hybrid 3D Dense Reconstruction Framework Based on Panoramic Stereo Images for a Trimming Robot | May 10, 2023 | Task Planning | CodeCode Available | 1 |
| TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs | Jul 11, 2022 | BenchmarkingRepresentation Learning | CodeCode Available | 1 |
| Sequential Manipulation Planning on Scene Graph | Jul 10, 2022 | Object RearrangementStochastic Optimization | CodeCode Available | 1 |
| PlanSys2: A Planning System Framework for ROS2 | Jul 1, 2021 | Task Planning | CodeCode Available | 1 |
| Extended Tree Search for Robot Task and Motion Planning | Mar 9, 2021 | Decision MakingMotion Planning | CodeCode Available | 1 |
| Actionet: An Interactive End-To-End Platform For Task-Based Data Collection And Augmentation In 3D Environment | Oct 3, 2020 | Dataset GenerationTask Planning | CodeCode Available | 1 |
| Learning to combine primitive skills: A step towards versatile robotic manipulation | Aug 2, 2019 | Data AugmentationImitation Learning | CodeCode Available | 1 |
| MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification | Jun 26, 2025 | Image SegmentationLarge Language Model | —Unverified | 0 |
| VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models | Jun 21, 2025 | Action GenerationContinual Learning | —Unverified | 0 |
| Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning | Jun 20, 2025 | Computational EfficiencyTask Planning | —Unverified | 0 |
| Towards AI Search Paradigm | Jun 20, 2025 | Decision MakingRetrieval-augmented Generation | —Unverified | 0 |
| Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills | Jun 12, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Jun 10, 2025 | Task PlanningVisual Reasoning | —Unverified | 0 |
| Language-Vision Planner and Executor for Text-to-Visual Reasoning | Jun 9, 2025 | In-Context LearningMME | —Unverified | 0 |
| Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search | Jun 8, 2025 | Common Sense ReasoningMotion Planning | CodeCode Available | 0 |
| RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks | Jun 7, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| Hierarchical Debate-Based Large Language Model (LLM) for Complex Task Planning of 6G Network Management | Jun 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions | Jun 4, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs | Jun 3, 2025 | ObjectObject Rearrangement | —Unverified | 0 |
| ChemGraph: An Agentic Framework for Computational Chemistry Workflows | Jun 3, 2025 | Computational chemistryGraph Neural Network | —Unverified | 0 |
| Grounded Vision-Language Interpreter for Integrated Task and Motion Planning | Jun 3, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks | May 31, 2025 | Task PlanningVision-Language-Action | —Unverified | 0 |
| MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework | May 24, 2025 | Task Planning | —Unverified | 0 |
| Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets | May 21, 2025 | Dataset GenerationDescriptive | —Unverified | 0 |
| Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent | May 20, 2025 | Task Planning | —Unverified | 0 |
| APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight | May 20, 2025 | Causal InferenceDecision Making | CodeCode Available | 0 |
| REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning? | May 16, 2025 | Large Language ModelRobot Task Planning | —Unverified | 0 |
| LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution | May 15, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents | May 2, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Apr 30, 2025 | Large Language ModelMotion Planning | —Unverified | 0 |
| CoordField: Coordination Field for Agentic UAV Task Allocation In Low-altitude Urban Scenarios | Apr 30, 2025 | Task Planning | —Unverified | 0 |
| NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks | Apr 28, 2025 | Task PlanningVision-Language-Action | —Unverified | 0 |
| Robo-Troj: Attacking LLM-based Task Planners | Apr 23, 2025 | Backdoor AttackDiversity | —Unverified | 0 |
| A Framework for Benchmarking and Aligning Task-Planning Safety in LLM-Based Embodied Agents | Apr 20, 2025 | BenchmarkingTask Planning | —Unverified | 0 |
| InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Apr 17, 2025 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment | Apr 11, 2025 | 3D geometryNatural Language Queries | —Unverified | 0 |
| Personality-Driven Decision-Making in LLM-Based Autonomous Agents | Apr 1, 2025 | Decision MakingScheduling | —Unverified | 0 |
| Visual Environment-Interactive Planning for Embodied Complex-Question Answering | Apr 1, 2025 | Question AnsweringTask Planning | —Unverified | 0 |
| Adaptive Interactive Navigation of Quadruped Robots using Large Language Models | Mar 29, 2025 | Motion PlanningTask Planning | —Unverified | 0 |
| REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation | Mar 28, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |