SOTAVerified

Decision Making

Papers

Showing 32513275 of 12311 papers

TitleStatusHype
Assessing Robustness of Machine Learning Models using Covariate Perturbations0
The Mismeasure of Man and Models: Evaluating Allocational Harms in Large Language Models0
Metareasoning in uncertain environments: a meta-BAMDP framework0
Deep Learning in Medical Image Classification from MRI-based Brain Tumor Images0
Generalisation of Total Uncertainty in AI: A Theoretical Study0
Reinforcement Learning applied to Insurance Portfolio PursuitCode0
Improving Machine Learning Based Sepsis Diagnosis Using Heart Rate Variability0
Load Balancing in Federated Learning0
Cost-Effective Hallucination Detection for LLMs0
Pathology Foundation Models0
Who should I trust? A Visual Analytics Approach for Comparing Net Load Forecasting Models0
Voxel Scene Graph for Intracranial HemorrhageCode0
KemenkeuGPT: Leveraging a Large Language Model on Indonesia's Government Financial Data and Regulations to Enhance Decision Making0
Interpreting and learning voice commands with a Large Language Model for a robot system0
Enhancing Agricultural Machinery Management through Advanced LLM Integration0
Deduction Game Framework and Information Set Entropy Search0
Towards an Integrated Performance Framework for Fire Science and Management Workflows0
Extending choice assessments to choice functions: An algorithm for computing the natural extension0
Powerful A/B-Testing Metrics and Where to Find Them0
How to Choose a Reinforcement-Learning Algorithm0
From Feature Importance to Natural Language Explanations Using LLMs with RAGCode0
DiffusionCounterfactuals: Inferring High-dimensional Counterfactuals with Guidance of Causal Representations0
Legal Minds, Algorithmic Decisions: How LLMs Apply Constitutional Principles in Complex Scenarios0
Time series forecasting with high stakes: A field study of the air cargo industry0
Mitigating Farmland Biodiversity Loss: A Bio-Economic Model of Land Consolidation and Pesticide Use0
Show:102550
← PrevPage 131 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified