SOTAVerified

Robot Task Planning

Papers

Showing 148 of 48 papers

TitleStatusHype
REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?0
LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition LanguageCode1
Self-Corrective Task Planning by Inverse Prompting with Large Language Models0
MRBTP: Efficient Multi-Robot Behavior Tree Planning and CollaborationCode1
Scalable Hierarchical Reinforcement Learning for Hyper Scale Multi-Robot Task Planning0
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-WorldCode2
One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering0
VeriGraph: Scene Graphs for Execution Verifiable Robot Planning0
CLIMB: Language-Guided Continual Learning for Task Planning with Iterative Model Building0
VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model0
SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language ModelsCode1
COHERENT: Collaboration of Heterogeneous Multi-Robot System with Large Language ModelsCode2
Encoding Reusable Multi-Robot Planning Strategies as Abstract Hypergraphs0
Nl2Hltl2Plan: Scaling Up Natural Language Understanding for Multi-Robots Through Hierarchical Temporal Logic Task Representation0
DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning0
DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models0
Natural Language as Policies: Reasoning for Coordinate-Level Embodied Control with LLMs0
SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models0
Large Language Models for Robotics: Opportunities, Challenges, and Perspectives0
How to Raise a Robot -- A Case for Neuro-Symbolic AI in Constrained Task Planning for Humanoid Assistive Robots0
Sequential Planning in Large Partially Observable Environments guided by LLMsCode1
Vision-Language Interpreter for Robot Task PlanningCode1
SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning0
REFLECT: Summarizing Robot Experiences for Failure Explanation and CorrectionCode1
Robot Task Planning Based on Large Language Model Representing Knowledge with Directed Graph StructuresCode0
SheetCopilot: Bringing Software Productivity to the Next Level through Large Language ModelsCode1
Parsel: Algorithmic Reasoning with Language Models by Composing DecompositionsCode2
Entropy Rate Maximization of Markov Decision Processes under Linear Temporal Logic Tasks0
You Don't Know When I Will Arrive: Unpredictable Controller Synthesis for Temporal Logic Tasks0
Robot Task Planning and Situation Handling in Open Worlds0
BusyBot: Learning to Interact, Reason, and Plan in a BusyBoard EnvironmentCode1
TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphsCode1
0/1 Deep Neural Networks via Block Coordinate Descent0
Do As I Can, Not As I Say: Grounding Language in Robotic AffordancesCode2
You Only Demonstrate Once: Category-Level Manipulation from Single Visual DemonstrationCode2
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied AgentsCode2
Using Human-Guided Causal Knowledge for More Generalized Robot Task Planning0
CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from SimulationCode1
Q-attention: Enabling Efficient Learning for Vision-based Robotic ManipulationCode1
PackIt: A Virtual Environment for Geometric PlanningCode1
3D Dynamic Scene Graphs: Actionable Spatial Perception with Places, Objects, and HumansCode2
Task Planning with a Weighted Functional Object-Oriented NetworkCode0
The CoSTAR Block Stacking Dataset: Learning with Workspace ConstraintsCode0
Task Planning in Robotics: an Empirical Comparison of PDDL-based and ASP-based Systems0
Visual Robot Task PlanningCode0
Learning to Imagine Manipulation Goals for Robot Task Planning0
Inferring Forces and Learning Human Utilities From Videos0
Plan Explicability and Predictability for Robot Task Planning0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PackNNAverage Reward64.9Unverified
2Heuristic Largest First-Aligned-BLBFAverage Reward59.2Unverified
3Heuristic Largest First-Aligned-RandomAverage Reward49.4Unverified
4Heuristic Random-Aligned-BLBFAverage Reward41.9Unverified
#ModelMetricClaimedVerifiedStatus
1SheetAgent (GPT-3.5)Pass@161.1Unverified
2SheetCopilot (NIPS2023)Pass@144.3Unverified