| The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare | Nov 5, 2024 | Task Planning | —Unverified | 0 |
| A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics | Oct 30, 2024 | General KnowledgePrompt Engineering | CodeCode Available | 0 |
| EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents | Oct 30, 2024 | Large Language ModelObject Rearrangement | —Unverified | 0 |
| Optimal planning for heterogeneous autonomous teams with precedence and compatibility constraints and its application on power grid inspection with Unmanned Aerial Vehicles | Oct 28, 2024 | Task PlanningTraveling Salesman Problem | CodeCode Available | 0 |
| MaCTG: Multi-Agent Collaborative Thought Graph for Automatic Programming | Oct 25, 2024 | Code GenerationHallucination | —Unverified | 0 |
| FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning | Oct 25, 2024 | graph constructionRAG | —Unverified | 0 |
| VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use | Oct 21, 2024 | Image CaptioningTask Planning | —Unverified | 0 |
| RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents | Oct 17, 2024 | Question AnsweringTask Planning | —Unverified | 0 |
| CLIMB: Language-Guided Continual Learning for Task Planning with Iterative Model Building | Oct 17, 2024 | Continual LearningDescriptive | —Unverified | 0 |
| AT-MoE: Adaptive Task-planning Mixture of Experts via LoRA Approach | Oct 12, 2024 | Mixture-of-ExpertsTask Planning | —Unverified | 0 |
| VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model | Oct 11, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Agent S: An Open Agentic Framework that Uses Computers Like a Human | Oct 10, 2024 | AI AgentTask Planning | CodeCode Available | 11 |
| ConceptAgent: LLM-Driven Precondition Grounding and Tree Search for Robust Task Planning and Execution | Oct 8, 2024 | Common Sense ReasoningLogical Fallacies | —Unverified | 0 |
| ET-Plan-Bench: Embodied Task-level Planning Benchmark Towards Spatial-Temporal Cognition with Foundation Models | Oct 2, 2024 | DiagnosticTask Planning | —Unverified | 0 |
| Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy | Oct 2, 2024 | Motion PlanningRobot Manipulation | CodeCode Available | 2 |
| LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner | Sep 30, 2024 | Heuristic SearchLanguage Modeling | —Unverified | 0 |
| SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models | Sep 28, 2024 | Drone navigationRobot Manipulation | CodeCode Available | 1 |
| An Epistemic Human-Aware Task Planner which Anticipates Human Beliefs and Decisions | Sep 27, 2024 | Task Planning | —Unverified | 0 |
| COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language Models | Sep 23, 2024 | Robot Task PlanningTask Planning | CodeCode Available | 2 |
| KARMA: Augmenting Embodied AI Agents with Long-and-short Term Memory Systems | Sep 23, 2024 | AI AgentTask Planning | CodeCode Available | 0 |
| AlignBot: Aligning VLM-powered Customized Task Planning with User Reminders Through Fine-Tuning for Household Robots | Sep 18, 2024 | Task Planning | —Unverified | 0 |
| LEMMo-Plan: LLM-Enhanced Learning from Multi-Modal Demonstration for Planning Sequential Contact-Rich Manipulation Tasks | Sep 18, 2024 | Contact-rich ManipulationIn-Context Learning | —Unverified | 0 |
| P-RAG: Progressive Retrieval Augmented Generation For Planning on Embodied Everyday Task | Sep 17, 2024 | Large Language ModelRAG | —Unverified | 0 |
| SIFToM: Robust Spoken Instruction Following through Theory of Mind | Sep 17, 2024 | Instruction FollowingTask Planning | —Unverified | 0 |
| Encoding Reusable Multi-Robot Planning Strategies as Abstract Hypergraphs | Sep 16, 2024 | Robot Task PlanningTask Planning | —Unverified | 0 |
| Relevance for Human Robot Collaboration | Sep 12, 2024 | Dimensionality ReductionScene Understanding | —Unverified | 0 |
| Scalable Task Planning via Large Language Models and Structured World Representations | Sep 7, 2024 | Task Planning | —Unverified | 0 |
| EMPOWER: Embodied Multi-role Open-vocabulary Planning with Online Grounding and Execution | Aug 30, 2024 | Task Planning | —Unverified | 0 |
| AeroVerse: UAV-Agent Benchmark Suite for Simulating, Pre-training, Finetuning, and Evaluating Aerospace Embodied World Models | Aug 28, 2024 | Spatial ReasoningTask Planning | —Unverified | 0 |
| LLM-enhanced Scene Graph Learning for Household Rearrangement | Aug 22, 2024 | Common Sense ReasoningGraph Learning | —Unverified | 0 |
| Plan with Code: Comparing approaches for robust NL to DSL generation | Aug 15, 2024 | Code GenerationHallucination | —Unverified | 0 |
| General-purpose Clothes Manipulation with Semantic Keypoints | Aug 15, 2024 | Action GenerationLanguage Modeling | —Unverified | 0 |
| Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model | Aug 15, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Nl2Hltl2Plan: Scaling Up Natural Language Understanding for Multi-Robots Through Hierarchical Temporal Logic Task Representation | Aug 15, 2024 | Natural Language UnderstandingRobot Task Planning | —Unverified | 0 |
| Retrieval-Augmented Hierarchical in-Context Reinforcement Learning and Hindsight Modular Reflections for Task Planning with LLMs | Aug 12, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Multi-Agent Planning Using Visual Language Models | Aug 10, 2024 | Task Planning | —Unverified | 0 |
| EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI Agents | Aug 8, 2024 | Scene GenerationTask Planning | CodeCode Available | 1 |
| Towards Coarse-grained Visual Language Navigation Task Planning Enhanced by Event Knowledge Graph | Aug 5, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| EPD: Long-term Memory Extraction, Context-awared Planning and Multi-iteration Decision @ EgoPlan Challenge ICML 2024 | Jul 28, 2024 | Decision MakingTask Planning | CodeCode Available | 0 |
| Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMs | Jul 26, 2024 | Action GenerationLarge Language Model | CodeCode Available | 1 |
| WorkR: Occupation Inference for Intelligent Task Assistance | Jul 26, 2024 | ManagementTask Planning | —Unverified | 0 |
| DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level Control | Jul 20, 2024 | Instruction FollowingNavigate | CodeCode Available | 1 |
| AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases | Jul 17, 2024 | Autonomous DrivingBackdoor Attack | CodeCode Available | 3 |
| BadRobot: Jailbreaking Embodied LLMs in the Physical World | Jul 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An Optimal Task Planning and Agent-aware Allocation Algorithm in Collaborative Tasks Combining with PDDL and POPF | Jul 11, 2024 | Task Planning | —Unverified | 0 |
| Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation | Jul 8, 2024 | Decision MakingMotion Planning | —Unverified | 0 |
| This&That: Language-Gesture Controlled Video Generation for Robot Planning | Jul 8, 2024 | Task PlanningVideo Generation | —Unverified | 0 |
| MFE-ETP: A Comprehensive Evaluation Benchmark for Multi-modal Foundation Models on Embodied Task Planning | Jul 6, 2024 | Embodied Question AnsweringQuestion Answering | CodeCode Available | 0 |
| MARLIN: A Cloud Integrated Robotic Solution to Support Intralogistics in Retail | Jul 2, 2024 | Autonomous NavigationTask Planning | —Unverified | 0 |