SOTAVerified

Decision Making

Papers

Showing 33913400 of 12311 papers

TitleStatusHype
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis AgentsCode1
Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem0
TrafPS: A Shapley-based Visual Analytics Approach to Interpret Traffic0
End-to-end Conditional Robust Optimization0
A Survey on Human-AI Teaming with Large Pre-Trained Models0
Cooperative Bayesian Optimization for Imperfect Agents0
An Explainable AI Framework for Artificial Intelligence of Medical Things0
Levels of AI Agents: from Rules to Large Language Models0
Hitchhiker's guide to cancer-associated lymphoid aggregates in histology images: manual and deep learning-based quantification approaches0
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame SimulationsCode1
Show:102550
← PrevPage 340 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified