SOTAVerified

Decision Making

Papers

Showing 28512875 of 12311 papers

TitleStatusHype
β-calibration of Language Model Confidence Scores for Generative QA0
Optimizing Estimators of Squared Calibration Errors in Classification0
Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare0
Crafting desirable climate trajectories with RL explored socio-environmental simulationsCode0
The Moral Turing Test: Evaluating Human-LLM Alignment in Moral Decision-Making0
Modeling chaotic Lorenz ODE System using Scientific Machine Learning0
Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models0
DisasterQA: A Benchmark for Assessing the performance of LLMs in Disaster Response0
Generating Origin-Destination Matrices in Neural Spatial Interaction ModelsCode0
Towards an Operational Responsible AI Framework for Learning Analytics in Higher Education0
Navigating Inflation in Ghana: How Can Machine Learning Enhance Economic Stability and Growth Strategies0
Context-Aware Command Understanding for Tabletop Scenarios0
HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian AidCode0
Tree-Based Leakage Inspection and Control in Concept Bottleneck ModelsCode0
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile RobotsCode0
On the Modeling Capabilities of Large Language Models for Sequential Decision Making0
Biased AI can Influence Political Decision-Making0
Better than Your Teacher: LLM Agents that learn from Privileged AI Feedback0
Driving with Regulation: Interpretable Decision-Making for Autonomous Vehicles with Retrieval-Augmented Reasoning via LLM0
Functional Clustering of Discount Functions for Behavioral Investor Profiling0
Deep learning-based Visual Measurement Extraction within an Adaptive Digital Twin Framework from Limited Data Using Transfer Learning0
Assouad, Fano, and Le Cam with Interaction: A Unifying Lower Bound Framework and Characterization for Bandit Learnability0
ResTNet: Defense against Adversarial Policies via Transformer in Computer Go0
Intuitions of Compromise: Utilitarianism vs. ContractualismCode0
ReasoningRank: Teaching Student Models to Rank through Reasoning-Based Knowledge Distillation0
Show:102550
← PrevPage 115 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified