SOTAVerified

Decision Making

Papers

Showing 501525 of 12311 papers

TitleStatusHype
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame SimulationsCode1
Hybrid and Automated Machine Learning Approaches for Oil Fields Development: the Case Study of Volve Field, North SeaCode1
IdentiFace : A VGG Based Multimodal Facial Biometric SystemCode1
Rejecting Hallucinated State Targets during PlanningCode1
Aequitas: A Bias and Fairness Audit ToolkitCode1
ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical ImagesCode1
GLAMOUR: Graph Learning over Macromolecule RepresentationsCode1
Improving Recommendation Fairness via Data AugmentationCode1
Goal-directed graph construction using reinforcement learningCode1
In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-ThoughtCode1
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain FeedbackCode1
ChessGPT: Bridging Policy Learning and Language ModelingCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
Certified Reinforcement Learning with Logic GuidanceCode1
CELLO: Causal Evaluation of Large Vision-Language ModelsCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language ModelsCode1
ComBiNet: Compact Convolutional Bayesian Neural Network for Image SegmentationCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
DiffLoad: Uncertainty Quantification in Electrical Load Forecasting with the Diffusion ModelCode1
Can Learned Optimization Make Reinforcement Learning Less Difficult?Code1
Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision TreesCode1
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMsCode1
Can Increasing Input Dimensionality Improve Deep Reinforcement Learning?Code1
Can language agents be alternatives to PPO? A Preliminary Empirical Study On OpenAI GymCode1
Show:102550
← PrevPage 21 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified