SOTAVerified

Decision Making

Papers

Showing 761770 of 12311 papers

TitleStatusHype
DORA: Exploring Outlier Representations in Deep Neural NetworksCode1
ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent CollaborationCode1
Beyond Pixels: Enhancing LIME with Hierarchical Features and Segmentation Foundation ModelsCode1
LegalAgentBench: Evaluating LLM Agents in Legal DomainCode1
Collective Intelligence in Human-AI Teams A Bayesian Theory of Mind ApproachCode1
Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley ValuesCode1
Efficient Nonmyopic Bayesian Optimization via One-Shot Multi-Step TreesCode1
DocLens: Multi-aspect Fine-grained Evaluation for Medical Text GenerationCode1
Active Inference and Behavior Trees for Reactive Action Planning and Execution in RoboticsCode1
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateCode1
Show:102550
← PrevPage 77 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified