SOTAVerified

Decision Making

Papers

Showing 42764300 of 12311 papers

TitleStatusHype
AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments0
AgentCF: Collaborative Learning with Autonomous Language Agents for Recommender Systems0
Efficient Epistemic Uncertainty Estimation in Regression Ensemble Models Using Pairwise-Distance Estimators0
CausalCity: Complex Simulations with Agency for Causal Discovery and Reasoning0
A Pilot Evaluation of ChatGPT and DALL-E 2 on Decision Making and Spatial Reasoning0
Accurate melting point prediction through autonomous physics-informed learning0
Causal Bayesian Optimization0
Causal Bandits: Online Decision-Making in Endogenous Settings0
Resilient Supplier Selection in Logistics 4.0 with Heterogeneous Information0
ESMC: Entire Space Multi-Task Model for Post-Click Conversion Rate via Parameter Constraint0
Establishment and Solution of a Multi-Stage Decision Model Based on Hypothesis Testing and Dynamic Programming Algorithm0
Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches0
Causal Abstraction in Model Interpretability: A Compact Survey0
Active Preference Inference using Language Models and Probabilistic Reasoning0
CatMemo at the FinLLM Challenge Task: Fine-Tuning Large Language Models using Data Fusion in Financial Applications0
Category Theoretic Analysis of Photon-based Decision Making0
Equal Opportunity and Affirmative Action via Counterfactual Predictions0
A Personalized Data-to-Text Support Tool for Cancer Patients0
CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans0
A Parametric Top-View Representation of Complex Road Scenes0
Catastrophe, Compounding & Consistency in Choice0
Agent-Based Simulations of Online Political Discussions: A Case Study on Elections in Germany0
Equalizing Recourse across Groups0
Agent-Based Model: Simulating a Virus Expansion Based on the Acceptance of Containment Measures0
Epidemiological data challenges: planning for a more robust future through data standards0
Show:102550
← PrevPage 172 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified