SOTAVerified

Decision Making

Papers

Showing 17011725 of 12311 papers

TitleStatusHype
Calibrated Decision-Making through LLM-Assisted Retrieval0
Do LLM Personas Dream of Bull Markets? Comparing Human and AI Investment Strategies Through the Lens of the Five-Factor Model0
Moral Agency in Silico: Exploring Free Will in Large Language Models0
Project MPG: towards a generalized performance benchmark for LLM capabilities0
Sabotage Evaluations for Frontier Models0
Bayesian Regression for Predicting Subscription to Bank Term Deposits in Direct Marketing Campaigns0
Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games0
Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness0
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments0
Bridging the Gap between Expert and Language Models: Concept-guided Chess Commentary Generation and Evaluation0
Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric AssessmentsCode0
Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering0
Voting with Random Proposers: Two Rounds Suffice0
The green transition of firms: The role of evolutionary competition, adjustment costs, transition risk, and green technology progress0
Deconfounding Time Series Forecasting0
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning0
Improving Decision SparsityCode0
Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market SimulationsCode0
Toward Conditional Distribution Calibration in Survival PredictionCode1
Language Models And A Second Opinion Use Case: The Pocket Professional0
Predicting Mortality and Functional Status Scores of Traumatic Brain Injury Patients using Supervised Machine Learning0
Quantifying Risk Propensities of Large Language Models: Ethical Focus and Bias Detection through Role-Play0
KisanQRS: A Deep Learning-based Automated Query-Response System for Agricultural Decision-Making0
Causal Abstraction in Model Interpretability: A Compact Survey0
LLM-Consensus: Multi-Agent Debate for Visual Misinformation Detection0
Show:102550
← PrevPage 69 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified