| EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning | Dec 11, 2023 | BenchmarkingHuman-Object Interaction Detection | CodeCode Available | 1 |
| Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty | Dec 2, 2023 | DenoisingTask Planning | CodeCode Available | 1 |
| Physical Reasoning and Object Planning for Household Embodied Agents | Nov 22, 2023 | 2kDecision Making | CodeCode Available | 1 |
| Kinematic-aware Prompting for Generalizable Articulated Object Manipulation with LLMs | Nov 6, 2023 | Imitation LearningIn-Context Learning | CodeCode Available | 1 |
| Vision-Language Interpreter for Robot Task Planning | Nov 2, 2023 | Robot Task PlanningTask Planning | CodeCode Available | 1 |
| New Interaction Paradigm for Complex EDA Software Leveraging GPT | Jul 27, 2023 | Task Planning | CodeCode Available | 1 |
| Embodied Task Planning with Large Language Models | Jul 4, 2023 | Task Planning | CodeCode Available | 1 |
| SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models | May 30, 2023 | BenchmarkingCode Generation | CodeCode Available | 1 |
| Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds | May 27, 2023 | Task PlanningWorld Knowledge | CodeCode Available | 1 |
| A Multi-modal Garden Dataset and Hybrid 3D Dense Reconstruction Framework Based on Panoramic Stereo Images for a Trimming Robot | May 10, 2023 | Task Planning | CodeCode Available | 1 |
| TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs | Jul 11, 2022 | BenchmarkingRepresentation Learning | CodeCode Available | 1 |
| Sequential Manipulation Planning on Scene Graph | Jul 10, 2022 | Object RearrangementStochastic Optimization | CodeCode Available | 1 |
| PlanSys2: A Planning System Framework for ROS2 | Jul 1, 2021 | Task Planning | CodeCode Available | 1 |
| Extended Tree Search for Robot Task and Motion Planning | Mar 9, 2021 | Decision MakingMotion Planning | CodeCode Available | 1 |
| Actionet: An Interactive End-To-End Platform For Task-Based Data Collection And Augmentation In 3D Environment | Oct 3, 2020 | Dataset GenerationTask Planning | CodeCode Available | 1 |
| Learning to combine primitive skills: A step towards versatile robotic manipulation | Aug 2, 2019 | Data AugmentationImitation Learning | CodeCode Available | 1 |
| MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification | Jun 26, 2025 | Image SegmentationLarge Language Model | —Unverified | 0 |
| VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models | Jun 21, 2025 | Action GenerationContinual Learning | —Unverified | 0 |
| Towards AI Search Paradigm | Jun 20, 2025 | Decision MakingRetrieval-augmented Generation | —Unverified | 0 |
| Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning | Jun 20, 2025 | Computational EfficiencyTask Planning | —Unverified | 0 |
| Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills | Jun 12, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Jun 10, 2025 | Task PlanningVisual Reasoning | —Unverified | 0 |
| Language-Vision Planner and Executor for Text-to-Visual Reasoning | Jun 9, 2025 | In-Context LearningMME | —Unverified | 0 |
| Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search | Jun 8, 2025 | Common Sense ReasoningMotion Planning | CodeCode Available | 0 |
| RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks | Jun 7, 2025 | Large Language ModelTask Planning | —Unverified | 0 |