SOTAVerified

Task Planning

Papers

Showing 2650 of 344 papers

TitleStatusHype
LLM3:Large Language Model-based Task and Motion Planning with Motion Failure ReasoningCode2
LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied AgentsCode2
TrustAgent: Towards Safe and Trustworthy LLM-based AgentsCode2
Getting pwn'd by AI: Penetration Testing with Large Language ModelsCode2
SkiROS2: A skill-based Robot Control Platform for ROSCode2
Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task AgentsCode2
FlySearch: Exploring how vision-language models exploreCode1
BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVsCode1
CRAKEN: Cybersecurity LLM Agent with Knowledge-Based ExecutionCode1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
Enhancing LLM-Based Agents via Global Planning and Hierarchical ExecutionCode1
Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop FeedbackCode1
LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition LanguageCode1
MRBTP: Efficient Multi-Robot Behavior Tree Planning and CollaborationCode1
Plan-over-Graph: Towards Parallelable LLM Agent ScheduleCode1
Robotouille: An Asynchronous Planning Benchmark for LLM AgentsCode1
Large Language Models for Multi-Robot Systems: A SurveyCode1
VLM-driven Behavior Tree for Context-aware Task PlanningCode1
Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few ExamplesCode1
Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence ModelingCode1
SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language ModelsCode1
EARBench: Towards Evaluating Physical Risk Awareness for Task Planning of Foundation Model-based Embodied AI AgentsCode1
Wonderful Team: Zero-Shot Physical Task Planning with Visual LLMsCode1
DISCO: Embodied Navigation and Interaction via Differentiable Scene Semantics and Dual-level ControlCode1
Can only LLMs do Reasoning?: Potential of Small Language Models in Task PlanningCode1
Show:102550
← PrevPage 2 of 14Next →

No leaderboard results yet.