SOTAVerified

Decision Making

Papers

Showing 8190 of 12311 papers

TitleStatusHype
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator TrajectoriesCode2
What Makes a Good Diffusion Planner for Decision Making?Code2
Digital Player: Evaluating Large Language Models based Human-like Agent in GamesCode2
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision SupportCode2
Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First TimeCode2
On the Guidance of Flow MatchingCode2
LeapVAD: A Leap in Autonomous Driving via Cognitive Perception and Dual-Process ThinkingCode2
OptiChat: Bridging Optimization Models and Practitioners with Large Language ModelsCode2
Mechanistic understanding and validation of large AI models with SemanticLensCode2
UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission GenerationCode2
Show:102550
← PrevPage 9 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified