SOTAVerified

Task Planning

Papers

Showing 125 of 344 papers

TitleStatusHype
GTA1: GUI Test-time Scaling AgentCode2
MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification0
VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models0
Towards AI Search Paradigm0
Multimodal Fused Learning for Solving the Generalized Traveling Salesman Problem in Robotic Task Planning0
A Comprehensive Survey of Deep Research: Systems, Methodologies, and ApplicationsCode3
Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills0
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning0
Language-Vision Planner and Executor for Text-to-Visual Reasoning0
Prime the search: Using large language models for guiding geometric task and motion planning by warm-starting tree searchCode0
RoboPARA: Dual-Arm Robot Planning with Parallel Allocation and Recomposition Across Tasks0
Hierarchical Debate-Based Large Language Model (LLM) for Complex Task Planning of 6G Network Management0
Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions0
ChemGraph: An Agentic Framework for Computational Chemistry Workflows0
FlySearch: Exploring how vision-language models exploreCode1
Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs0
Grounded Vision-Language Interpreter for Integrated Task and Motion Planning0
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks0
MASTER: Multi-Agent Security Through Exploration of Roles and Topological Structures -- A Comprehensive Framework0
BEDI: A Comprehensive Benchmark for Evaluating Embodied Agents on UAVsCode1
CRAKEN: Cybersecurity LLM Agent with Knowledge-Based ExecutionCode1
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets0
Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent0
APEX: Empowering LLMs with Physics-Based Task Planning for Real-time InsightCode0
REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?0
Show:102550
← PrevPage 1 of 14Next →

No leaderboard results yet.