| Structured Preference Optimization for Vision-Language Long-Horizon Task Planning | Feb 28, 2025 | Task PlanningVisual Grounding | —Unverified | 0 |
| RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete | Feb 28, 2025 | Task PlanningTrajectory Prediction | —Unverified | 0 |
| MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration | Feb 25, 2025 | Robot Task PlanningTask Planning | CodeCode Available | 1 |
| RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents | Feb 23, 2025 | Task Planning | —Unverified | 0 |
| Plan-over-Graph: Towards Parallelable LLM Agent Schedule | Feb 20, 2025 | Task Planning | CodeCode Available | 1 |
| Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks | Feb 18, 2025 | Adversarial AttackAutonomous Vehicles | —Unverified | 0 |
| Scaling Autonomous Agents via Automatic Reward Modeling And Planning | Feb 17, 2025 | Decision MakingMathematical Problem-Solving | —Unverified | 0 |
| NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM | Feb 16, 2025 | NavigateRAG | CodeCode Available | 2 |
| OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning | Feb 16, 2025 | MedQAMMLU | —Unverified | 0 |
| D-CIPHER: Dynamic Collaborative Intelligent Multi-Agent System with Planner and Heterogeneous Executors for Offensive Security | Feb 15, 2025 | Task Planning | CodeCode Available | 2 |