SOTAVerified

Decision Making

Papers

Showing 24312440 of 12311 papers

TitleStatusHype
InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply ChainsCode1
Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent0
EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis0
Mask-Free Neuron Concept Annotation for Interpreting Neural Networks in Medical DomainCode0
Exploration Unbound0
Predicting Emotion Intensity in Polish Political Texts: Comparing Supervised Models and Large Language Models in a Resource-Poor LanguageCode0
UrbanWorld: An Urban World Model for 3D City GenerationCode2
Generally-Occurring Model Change for Robust Counterfactual Explanations0
Preemptive Detection and Correction of Misaligned Actions in LLM Agents0
How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language ModelsCode0
Show:102550
← PrevPage 244 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified