| GTA1: GUI Test-time Scaling Agent | Jul 8, 2025 | Reinforcement Learning (RL)Task Planning | CodeCode Available | 2 |
| MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification | Jun 26, 2025 | Image SegmentationLarge Language Model | —Unverified | 0 |
| VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models | Jun 21, 2025 | Action GenerationContinual Learning | —Unverified | 0 |
| Towards AI Search Paradigm | Jun 20, 2025 | Decision MakingRetrieval-augmented Generation | —Unverified | 0 |
| Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning | Jun 20, 2025 | Computational EfficiencyTask Planning | —Unverified | 0 |
| A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications | Jun 14, 2025 | Information RetrievalSurvey | CodeCode Available | 3 |
| Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills | Jun 12, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning | Jun 10, 2025 | Task PlanningVisual Reasoning | —Unverified | 0 |
| Language-Vision Planner and Executor for Text-to-Visual Reasoning | Jun 9, 2025 | In-Context LearningMME | —Unverified | 0 |
| Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree search | Jun 8, 2025 | Common Sense ReasoningMotion Planning | CodeCode Available | 0 |
| RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks | Jun 7, 2025 | Large Language ModelTask Planning | —Unverified | 0 |
| Hierarchical Debate-Based Large Language Model (LLM) for Complex Task Planning of 6G Network Management | Jun 6, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions | Jun 4, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| ChemGraph: An Agentic Framework for Computational Chemistry Workflows | Jun 3, 2025 | Computational chemistryGraph Neural Network | —Unverified | 0 |
| Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs | Jun 3, 2025 | ObjectObject Rearrangement | —Unverified | 0 |
| FlySearch: Exploring how vision-language models explore | Jun 3, 2025 | HallucinationTask Planning | CodeCode Available | 1 |
| Grounded Vision-Language Interpreter for Integrated Task and Motion Planning | Jun 3, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks | May 31, 2025 | Task PlanningVision-Language-Action | —Unverified | 0 |
| MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework | May 24, 2025 | Task Planning | —Unverified | 0 |
| BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVs | May 23, 2025 | Model OptimizationTask Planning | CodeCode Available | 1 |
| CRAKEN: Cybersecurity LLM Agent with Knowledge-Based Execution | May 21, 2025 | Large Language ModelTask Planning | CodeCode Available | 1 |
| Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets | May 21, 2025 | Dataset GenerationDescriptive | —Unverified | 0 |
| Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent | May 20, 2025 | Task Planning | —Unverified | 0 |
| APEX: Empowering LLMs with Physics-Based Task Planning for Real-time Insight | May 20, 2025 | Causal InferenceDecision Making | CodeCode Available | 0 |
| REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning? | May 16, 2025 | Large Language ModelRobot Task Planning | —Unverified | 0 |
| LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution | May 15, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents | May 2, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning | Apr 30, 2025 | Large Language ModelMotion Planning | —Unverified | 0 |
| LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics | Apr 30, 2025 | In-Context LearningObject | CodeCode Available | 1 |
| CoordField: Coordination Field for Agentic UAV Task Allocation In Low-altitude Urban Scenarios | Apr 30, 2025 | Task Planning | —Unverified | 0 |
| NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks | Apr 28, 2025 | Task PlanningVision-Language-Action | —Unverified | 0 |
| Robo-Troj: Attacking LLM-based Task Planners | Apr 23, 2025 | Backdoor AttackDiversity | —Unverified | 0 |
| Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution | Apr 23, 2025 | Task Planning | CodeCode Available | 1 |
| A Framework for Benchmarking and Aligning Task-Planning Safety in LLM-Based Embodied Agents | Apr 20, 2025 | BenchmarkingTask Planning | —Unverified | 0 |
| InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning | Apr 17, 2025 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment | Apr 11, 2025 | 3D geometryNatural Language Queries | —Unverified | 0 |
| Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents | Apr 1, 2025 | AI AgentTask Planning | CodeCode Available | 11 |
| Visual Environment-Interactive Planning for Embodied Complex-Question Answering | Apr 1, 2025 | Question AnsweringTask Planning | —Unverified | 0 |
| Personality-Driven Decision-Making in LLM-Based Autonomous Agents | Apr 1, 2025 | Decision MakingScheduling | —Unverified | 0 |
| Adaptive Interactive Navigation of Quadruped Robots using Large Language Models | Mar 29, 2025 | Motion PlanningTask Planning | —Unverified | 0 |
| REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation | Mar 28, 2025 | Robot ManipulationTask Planning | —Unverified | 0 |
| Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback | Mar 27, 2025 | Task Planning | CodeCode Available | 1 |
| Multi-agent Application System in Office Collaboration Scenarios | Mar 25, 2025 | Decision MakingTask Planning | —Unverified | 0 |
| LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language | Mar 21, 2025 | In-Context LearningRobot Task Planning | CodeCode Available | 1 |
| Safety Aware Task Planning via Large Language Models in Robotics | Mar 19, 2025 | Task Planning | —Unverified | 0 |
| Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs | Mar 19, 2025 | ObjectRobot Navigation | —Unverified | 0 |
| Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning | Mar 17, 2025 | Scene SegmentationTask Planning | —Unverified | 0 |
| Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided, Self-Consistent MLLMs for Food Preparation Task Planning | Mar 17, 2025 | Collision AvoidanceIn-Context Learning | —Unverified | 0 |
| Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills | Mar 16, 2025 | Task Planning | —Unverified | 0 |