SOTAVerified

Decision Making

Papers

Showing 51765200 of 12311 papers

TitleStatusHype
AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems0
LLaMA Rider: Spurring Large Language Models to Explore the Open World0
Efficient Apple Maturity and Damage Assessment: A Lightweight Detection Model with GAN and Attention Mechanism0
Bad Values but Good Behavior: Learning Highly Misspecified Bandits and MDPs0
The Impact of Explanations on Fairness in Human-AI Decision-Making: Protected vs Proxy Features0
Tightening Bounds on Probabilities of Causation By Merging Datasets0
Visual Attention Prompted Prediction and LearningCode0
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models0
Towards Running Time Analysis of Interactive Multi-objective Evolutionary Algorithms0
Risk-informed Resilience Planning of Transmission Systems Against Ice Storms0
XAI Benchmark for Visual Explanation0
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios0
Novelty Detection in Reinforcement Learning with World Models0
Examining the Potential and Pitfalls of ChatGPT in Science and Engineering Problem-Solving0
Generative Intrinsic Optimization: Intrinsic Control with Model Learning0
Learning Transferable Conceptual Prototypes for Interpretable Unsupervised Domain Adaptation0
Question Answering for Electronic Health Records: A Scoping Review of datasets and models0
If our aim is to build morality into an artificial agent, how might we begin to go about doing so?0
Receive, Reason, and React: Drive as You Say with Large Language Models in Autonomous Vehicles0
How Does Artificial Intelligence Improve Human Decision-Making? Evidence from the AI-Powered Go Program0
From Large Language Models to Knowledge Graphs for Biomarker Discovery in Cancer0
Learning a Reward Function for User-Preferred Appliance SchedulingCode0
Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation Learning0
NeuroInspect: Interpretable Neuron-based Debugging Framework through Class-conditional VisualizationsCode0
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models0
Show:102550
← PrevPage 208 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified