| MMRo: Are Multimodal LLMs Eligible as the Brain for In-Home Robotics? | Jun 28, 2024 | Task PlanningVisual Reasoning | —Unverified | 0 |
| DKPROMPT: Domain Knowledge Prompting Vision-Language Models for Open-World Planning | Jun 25, 2024 | Robot Task PlanningTask Planning | —Unverified | 0 |
| Automating Transfer of Robot Task Plans using Functorial Data Migrations | Jun 22, 2024 | Task Planning | —Unverified | 0 |
| Diffusion-Based Failure Sampling for Evaluating Safety-Critical Autonomous Systems | Jun 20, 2024 | DenoisingTask Planning | CodeCode Available | 0 |
| Embodied Instruction Following in Unknown Environments | Jun 17, 2024 | Instruction FollowingTask Planning | —Unverified | 0 |
| DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning | Jun 14, 2024 | Task Planning | —Unverified | 0 |
| Details Make a Difference: Object State-Sensitive Neurorobotic Task Planning | Jun 14, 2024 | Dense CaptioningObject | CodeCode Available | 0 |
| RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent | Jun 11, 2024 | AI AgentDescriptive | CodeCode Available | 2 |
| NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security | Jun 8, 2024 | Task PlanningVulnerability Detection | CodeCode Available | 11 |
| Tool-Planner: Task Planning with Clusters across Multiple Tools | Jun 6, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| CLMASP: Coupling Large Language Models with Answer Set Programming for Robotic Task Planning | Jun 5, 2024 | Task Planning | —Unverified | 0 |
| From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems | May 30, 2024 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Can Graph Learning Improve Planning in LLM-based Agents? | May 29, 2024 | Decision MakingGraph Learning | CodeCode Available | 2 |
| Tool Learning with Large Language Models: A Survey | May 28, 2024 | Response GenerationSurvey | CodeCode Available | 3 |
| Planning with Multi-Constraints via Collaborative Language Agents | May 26, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games | May 22, 2024 | Code GenerationDecision Making | —Unverified | 0 |
| ManiFoundation Model for General-Purpose Robotic Manipulation of Contact Synthesis with Arbitrary Objects and Robots | May 11, 2024 | DiversityObject | —Unverified | 0 |
| LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots | Apr 22, 2024 | Imitation LearningTask Planning | —Unverified | 0 |
| Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following | Apr 21, 2024 | In-Context LearningInstruction Following | —Unverified | 0 |
| Learning Symbolic Task Representation from a Human-Led Demonstration: A Memory to Store, Retrieve, Consolidate, and Forget Experiences | Apr 16, 2024 | One-Shot LearningTask Planning | CodeCode Available | 0 |
| Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V | Apr 16, 2024 | Instruction FollowingMultimodal Reasoning | —Unverified | 0 |
| VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots | Apr 5, 2024 | Code GenerationTask Planning | —Unverified | 0 |
| Can only LLMs do Reasoning?: Potential of Small Language Models in Task Planning | Apr 5, 2024 | Task Planning | CodeCode Available | 1 |
| DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models | Apr 4, 2024 | Common Sense ReasoningComputational Efficiency | —Unverified | 0 |
| A Survey of Optimization-based Task and Motion Planning: From Classical To Learning Approaches | Apr 3, 2024 | Motion PlanningSurvey | —Unverified | 0 |
| Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods | Mar 30, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning | Mar 29, 2024 | HallucinationTask Planning | CodeCode Available | 0 |
| Prioritize Team Actions: Multi-Agent Temporal Logic Task Planning with Ordering Constraints | Mar 26, 2024 | Task Planning | —Unverified | 0 |
| TwoStep: Multi-agent Task Planning using Classical Planners and Large Language Models | Mar 25, 2024 | Task Planning | —Unverified | 0 |
| GOLF: Goal-Oriented Long-term liFe tasks supported by human-AI collaboration | Mar 25, 2024 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Learning Hierarchical Control Systems for Autonomous Systems with Energy Constraints | Mar 21, 2024 | energy managementManagement | —Unverified | 0 |
| Natural Language as Policies: Reasoning for Coordinate-Level Embodied Control with LLMs | Mar 20, 2024 | Logical ReasoningPrompt Engineering | —Unverified | 0 |
| LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning | Mar 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| FLAP: Flow-Adhering Planning with Constrained Decoding in LLMs | Mar 9, 2024 | Task Planning | —Unverified | 0 |
| RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation | Mar 8, 2024 | Code GenerationHallucination | CodeCode Available | 3 |
| Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation | Mar 6, 2024 | PositionTask Planning | —Unverified | 0 |
| Optimal Integrated Task and Path Planning and Its Application to Multi-Robot Pickup and Delivery | Mar 2, 2024 | Task Planning | —Unverified | 0 |
| Probabilistically Correct Language-based Multi-Robot Planning using Conformal Prediction | Feb 23, 2024 | Conformal PredictionTask Planning | —Unverified | 0 |
| RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation | Feb 22, 2024 | Code GenerationCommon Sense Reasoning | —Unverified | 0 |
| WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment | Feb 19, 2024 | Program SynthesisTask Planning | —Unverified | 0 |
| AutoGPT+P: Affordance-based Task Planning with Large Language Models | Feb 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents | Feb 13, 2024 | BenchmarkingModel Selection | CodeCode Available | 2 |
| Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models | Feb 12, 2024 | HallucinationObject Localization | CodeCode Available | 4 |
| Belief Scene Graphs: Expanding Partial Scenes with Objects through Computation of Expectation | Feb 6, 2024 | Common Sense ReasoningTask Planning | —Unverified | 0 |
| TrustAgent: Towards Safe and Trustworthy LLM-based Agents | Feb 2, 2024 | Task Planning | CodeCode Available | 2 |
| Learning to Visually Connect Actions and their Effects | Jan 19, 2024 | Object TrackingTask Planning | —Unverified | 0 |
| Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual Models | Jan 17, 2024 | Task Planning | CodeCode Available | 3 |
| Consolidating Trees of Robotic Plans Generated Using Large Language Models to Improve Reliability | Jan 15, 2024 | Task Planning | —Unverified | 0 |
| Small LLMs Are Weak Tool Learners: A Multi-LLM Agent | Jan 14, 2024 | Language ModellingLarge Language Model | CodeCode Available | 3 |
| Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security | Jan 10, 2024 | Task Planning | CodeCode Available | 5 |