| A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Feb 5, 2025 | HallucinationSpatial Reasoning | —Unverified | 0 |
| 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Jan 28, 2025 | Instruction FollowingMixture-of-Experts | —Unverified | 0 |
| PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding | Jan 27, 2025 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 |
| Zero-shot Robotic Manipulation with Language-guided Instruction and Formal Task Planning | Jan 25, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation | Jan 21, 2025 | Task Planning | —Unverified | 0 |
| SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning | Jan 17, 2025 | Spatial ReasoningTask Planning | —Unverified | 0 |
| VLM-driven Behavior Tree for Context-aware Task Planning | Jan 7, 2025 | Task Planning | CodeCode Available | 1 |
| Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model | Dec 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning | Dec 27, 2024 | counterfactualHierarchical Reinforcement Learning | —Unverified | 0 |
| A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs | Dec 24, 2024 | AllTask Planning | —Unverified | 0 |
| Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples | Dec 23, 2024 | Common Sense ReasoningTask Planning | CodeCode Available | 1 |
| GraphAgent: Agentic Graph Language Assistant | Dec 22, 2024 | Knowledge GraphsNode Classification | CodeCode Available | 0 |
| Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and Execution | Dec 18, 2024 | Code GenerationTask Planning | —Unverified | 0 |
| From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle | Dec 17, 2024 | AI AgentFormal Logic | —Unverified | 0 |
| SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents | Dec 17, 2024 | Task Planning | CodeCode Available | 2 |
| Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning | Dec 16, 2024 | HallucinationRobot Manipulation | CodeCode Available | 2 |
| Ontology-driven Prompt Tuning for LLM-based Task and Motion Planning | Dec 10, 2024 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| HyperGraphOS: A Meta Operating System for Science and Engineering | Dec 6, 2024 | Code GenerationManagement | —Unverified | 0 |
| DataLab: A Unified Platform for LLM-Powered Business Intelligence | Dec 3, 2024 | Large Language ModelTask Planning | —Unverified | 0 |
| RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World | Nov 29, 2024 | Robot Task PlanningScheduling | CodeCode Available | 2 |
| One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering | Nov 29, 2024 | BenchmarkingObject | —Unverified | 0 |
| Time is on my sight: scene graph filtering for dynamic environment perception in an LLM-driven robot | Nov 22, 2024 | Object LocalizationTask Planning | —Unverified | 0 |
| Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation | Nov 18, 2024 | Knowledge GraphsRobot Manipulation | CodeCode Available | 0 |
| VeriGraph: Scene Graphs for Execution Verifiable Robot Planning | Nov 15, 2024 | Robot Task PlanningTask Planning | —Unverified | 0 |
| WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models | Nov 8, 2024 | Task PlanningZero-shot Generalization | CodeCode Available | 2 |