SOTAVerified

Decision Making

Papers

Showing 42514275 of 12311 papers

TitleStatusHype
GPT-4V(ision) Unsuitable for Clinical Care and Education: A Clinician-Evaluated Assessment0
Uncertainty Quantification in Neural-Network Based Pain Intensity Estimation0
Introducing an Improved Information-Theoretic Measure of Predictive Uncertainty0
Extrinsically-Focused Evaluation of Omissions in Medical Summarization0
Towards a Transportable Causal Network Model Based on Observational Healthcare Data0
Optimising Human-AI Collaboration by Learning Convincing Explanations0
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question AnsweringCode1
Real-Time Machine-Learning-Based Optimization Using Input Convex Long Short-Term Memory NetworkCode1
Decision-making under risk: when is utility maximization equivalent to risk minimization?0
Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization RegimeCode1
Enabling High-Level Machine Reasoning with Cognitive Neuro-Symbolic Systems0
Multi-agent Attacks for Black-box Social Recommendations0
Large Language Models for Robotics: A Survey0
An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making0
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives0
The Multi-BMBY Mechanism: Proportionality-Preserving and Strategyproof Ownership Restructuring in Private Companies0
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability0
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System0
ChatGPT Exhibits Gender and Racial Biases in Acute Coronary Syndrome Management0
Business Policy Experiments using Fractional Factorial Designs: Consumer Retention on DoorDash0
Sum-max Submodular Bandits0
GRAM: An Interpretable Approach for Graph Anomaly Detection using Gradient Attention Maps0
Language Models can be Logical Solvers0
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable UncertaintyCode1
Forte: An Interactive Visual Analytic Tool for Trust-Augmented Net Load Forecasting0
Show:102550
← PrevPage 171 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified