SOTAVerified|Agents Browse Leaderboard About Blog

Task Planning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 344 papers

Title	Date	Tasks	Status	Hype
Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual Models	Jan 17, 2024	Task Planning	CodeCode Available	3
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent	Jan 14, 2024	Language ModellingLarge Language Model	CodeCode Available	3
GTA1: GUI Test-time Scaling Agent	Jul 8, 2025	Reinforcement Learning (RL)Task Planning	CodeCode Available	2
NavRAG: Generating User Demand Instructions for Embodied Navigation through Retrieval-Augmented LLM	Feb 16, 2025	NavigateRAG	CodeCode Available	2
D-CIPHER: Dynamic Collaborative Intelligent Multi-Agent System with Planner and Heterogeneous Executors for Offensive Security	Feb 15, 2025	Task Planning	CodeCode Available	2
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model	Dec 30, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents	Dec 17, 2024	Task Planning	CodeCode Available	2
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning	Dec 16, 2024	HallucinationRobot Manipulation	CodeCode Available	2
RoboMatrix: A Skill-centric Hierarchical Framework for Scalable Robot Task Planning and Execution in Open-World	Nov 29, 2024	Robot Task PlanningScheduling	CodeCode Available	2
WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models	Nov 8, 2024	Task PlanningZero-shot Generalization	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 35Next →

No leaderboard results yet.