| Multi-agent Application System in Office Collaboration Scenarios | Mar 25, 2025 | Decision MakingTask Planning | —Unverified | 0 |
| Safety Aware Task Planning via Large Language Models in Robotics | Mar 19, 2025 | Task Planning | —Unverified | 0 |
| Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs | Mar 19, 2025 | ObjectRobot Navigation | —Unverified | 0 |
| Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning | Mar 17, 2025 | Scene SegmentationTask Planning | —Unverified | 0 |
| Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided, Self-Consistent MLLMs for Food Preparation Task Planning | Mar 17, 2025 | Collision AvoidanceIn-Context Learning | —Unverified | 0 |
| Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills | Mar 16, 2025 | Task Planning | —Unverified | 0 |
| World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning | Mar 13, 2025 | Task Planning | —Unverified | 0 |
| SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery | Mar 12, 2025 | Activity RecognitionAnatomy | —Unverified | 0 |
| Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study | Mar 11, 2025 | Code GenerationTask Planning | —Unverified | 0 |
| General-Purpose Aerial Intelligent Agents Empowered by Large Language Models | Mar 11, 2025 | Motion PlanningScene Understanding | —Unverified | 0 |
| Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception | Mar 10, 2025 | Task Planning | CodeCode Available | 0 |
| Self-Corrective Task Planning by Inverse Prompting with Large Language Models | Mar 10, 2025 | Robot Task PlanningTask Planning | —Unverified | 0 |
| STAR: A Foundation Model-driven Framework for Robust Task Planning and Failure Recovery in Robotic Systems | Mar 8, 2025 | Information RetrievalKnowledge Graphs | —Unverified | 0 |
| Safe LLM-Controlled Robots with Formal Guarantees via Reachability Analysis | Mar 5, 2025 | Autonomous NavigationNavigate | CodeCode Available | 0 |
| Improving Retrospective Language Agents via Joint Policy Gradient Optimization | Mar 3, 2025 | Decision MakingImitation Learning | —Unverified | 0 |
| CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments | Mar 2, 2025 | Task Planning | CodeCode Available | 0 |
| Structured Preference Optimization for Vision-Language Long-Horizon Task Planning | Feb 28, 2025 | Task PlanningVisual Grounding | —Unverified | 0 |
| RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete | Feb 28, 2025 | Task PlanningTrajectory Prediction | —Unverified | 0 |
| RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents | Feb 23, 2025 | Task Planning | —Unverified | 0 |
| Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks | Feb 18, 2025 | Adversarial AttackAutonomous Vehicles | —Unverified | 0 |
| Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Feb 17, 2025 | Decision MakingMathematical Problem-Solving | —Unverified | 0 |
| OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning | Feb 16, 2025 | MedQAMMLU | —Unverified | 0 |
| STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning | Feb 14, 2025 | Decision MakingSpatial Reasoning | —Unverified | 0 |
| Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting | Feb 13, 2025 | Decision MakingTask Planning | —Unverified | 0 |
| 3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning | Feb 13, 2025 | Code GenerationScene Understanding | —Unverified | 0 |
| A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Feb 5, 2025 | HallucinationSpatial Reasoning | —Unverified | 0 |
| 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Jan 28, 2025 | Instruction FollowingMixture-of-Experts | —Unverified | 0 |
| PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding | Jan 27, 2025 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 |
| Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning | Jan 25, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation | Jan 21, 2025 | Task Planning | —Unverified | 0 |
| SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning | Jan 17, 2025 | Spatial ReasoningTask Planning | —Unverified | 0 |
| Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning | Dec 27, 2024 | counterfactualHierarchical Reinforcement Learning | —Unverified | 0 |
| A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs | Dec 24, 2024 | AllTask Planning | —Unverified | 0 |
| GraphAgent: Agentic Graph Language Assistant | Dec 22, 2024 | Knowledge GraphsNode Classification | CodeCode Available | 0 |
| Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution | Dec 18, 2024 | Code GenerationTask Planning | —Unverified | 0 |
| From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle | Dec 17, 2024 | AI AgentFormal Logic | —Unverified | 0 |
| Ontology-driven Prompt Tuning for LLM-based Task and Motion Planning | Dec 10, 2024 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| HyperGraphOS: A Meta Operating System for Science and Engineering | Dec 6, 2024 | Code GenerationManagement | —Unverified | 0 |
| DataLab: A Unified Platform for LLM-Powered Business Intelligence | Dec 3, 2024 | Large Language ModelTask Planning | —Unverified | 0 |
| One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering | Nov 29, 2024 | BenchmarkingObject | —Unverified | 0 |
| Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot | Nov 22, 2024 | Object LocalizationTask Planning | —Unverified | 0 |
| Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation | Nov 18, 2024 | Knowledge GraphsRobot Manipulation | CodeCode Available | 0 |
| VeriGraph: Scene Graphs for Execution Verifiable Robot Planning | Nov 15, 2024 | Robot Task PlanningTask Planning | —Unverified | 0 |
| The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare | Nov 5, 2024 | Task Planning | —Unverified | 0 |
| A Comparison of Prompt Engineering Techniques for Task Planning and Execution in Service Robotics | Oct 30, 2024 | General KnowledgePrompt Engineering | CodeCode Available | 0 |
| EMOS: Embodiment-aware Heterogeneous Multi-robot Operating System with LLM Agents | Oct 30, 2024 | Large Language ModelObject Rearrangement | —Unverified | 0 |
| Optimal planning for heterogeneous autonomous teams with precedence and compatibility constraints and its application on power grid inspection with Unmanned Aerial Vehicles | Oct 28, 2024 | Task PlanningTraveling Salesman Problem | CodeCode Available | 0 |
| FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning | Oct 25, 2024 | graph constructionRAG | —Unverified | 0 |
| MaCTG: Multi-Agent Collaborative Thought Graph for Automatic Programming | Oct 25, 2024 | Code GenerationHallucination | —Unverified | 0 |
| VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use | Oct 21, 2024 | Image CaptioningTask Planning | —Unverified | 0 |