| World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning | Mar 13, 2025 | Task Planning | —Unverified | 0 |
| SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery | Mar 12, 2025 | Activity RecognitionAnatomy | —Unverified | 0 |
| General-Purpose Aerial Intelligent Agents Empowered by Large Language Models | Mar 11, 2025 | Motion PlanningScene Understanding | —Unverified | 0 |
| Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study | Mar 11, 2025 | Code GenerationTask Planning | —Unverified | 0 |
| Self-Corrective Task Planning by Inverse Prompting with Large Language Models | Mar 10, 2025 | Robot Task PlanningTask Planning | —Unverified | 0 |
| Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception | Mar 10, 2025 | Task Planning | CodeCode Available | 0 |
| STAR: A Foundation Model-driven Framework for Robust Task Planning and Failure Recovery in Robotic Systems | Mar 8, 2025 | Information RetrievalKnowledge Graphs | —Unverified | 0 |
| Safe LLM-Controlled Robots with Formal Guarantees via Reachability Analysis | Mar 5, 2025 | Autonomous NavigationNavigate | CodeCode Available | 0 |
| Improving Retrospective Language Agents via Joint Policy Gradient Optimization | Mar 3, 2025 | Decision MakingImitation Learning | —Unverified | 0 |
| CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments | Mar 2, 2025 | Task Planning | CodeCode Available | 0 |
| Structured Preference Optimization for Vision-Language Long-Horizon Task Planning | Feb 28, 2025 | Task PlanningVisual Grounding | —Unverified | 0 |
| RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete | Feb 28, 2025 | Task PlanningTrajectory Prediction | —Unverified | 0 |
| MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration | Feb 25, 2025 | Robot Task PlanningTask Planning | CodeCode Available | 1 |
| RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents | Feb 23, 2025 | Task Planning | —Unverified | 0 |
| Plan-over-Graph: Towards Parallelable LLM Agent Schedule | Feb 20, 2025 | Task Planning | CodeCode Available | 1 |
| Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks | Feb 18, 2025 | Adversarial AttackAutonomous Vehicles | —Unverified | 0 |
| Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Feb 17, 2025 | Decision MakingMathematical Problem-Solving | —Unverified | 0 |
| NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM | Feb 16, 2025 | NavigateRAG | CodeCode Available | 2 |
| OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning | Feb 16, 2025 | MedQAMMLU | —Unverified | 0 |
| D-CIPHER: Dynamic Collaborative Intelligent Multi-Agent System with Planner and Heterogeneous Executors for Offensive Security | Feb 15, 2025 | Task Planning | CodeCode Available | 2 |
| STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning | Feb 14, 2025 | Decision MakingSpatial Reasoning | —Unverified | 0 |
| Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting | Feb 13, 2025 | Decision MakingTask Planning | —Unverified | 0 |
| 3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning | Feb 13, 2025 | Code GenerationScene Understanding | —Unverified | 0 |
| Robotouille: An Asynchronous Planning Benchmark for LLM Agents | Feb 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Models for Multi-Robot Systems: A Survey | Feb 6, 2025 | Action GenerationBenchmarking | CodeCode Available | 1 |