SOTAVerified

Decision Making

Papers

Showing 19011925 of 12311 papers

TitleStatusHype
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations0
Llamarine: Open-source Maritime Industry-specific Large Language Model0
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language ModelsCode0
Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications0
Large Language Model Strategic Reasoning Evaluation through Behavioral Game Theory0
Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights0
Non-Cooperative Games with Uncertainty0
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application0
Efficient Risk-sensitive Planning via Entropic Risk Measures0
Can a calibration metric be both testable and actionable?Code0
Program Synthesis Dialog Agents for Interactive Decision-MakingCode0
Data-Efficient Multi-Agent Spatial Planning with LLMs0
Voting or Consensus? Decision-Making in Multi-Agent DebateCode0
Learning Ensembles of Interpretable Simple Structure0
A Causal Lens for Evaluating Faithfulness Metrics0
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management StrategiesCode0
Certified Decisions0
An Ensemble Framework for Probabilistic Short-Term Load Forecasting Based on BiTCN and Deep Attention NetworksCode0
Heterogeneous Decision Making in Mixed Traffic: Uncertainty-aware Planning and Bounded Rationality0
Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent0
Global-Decision-Focused Neural ODEs for Proactive Grid Resilience Management0
A Collection of Innovations in Medical AI for patient records in 20240
Generalized Decision Focused Learning under Imprecise Uncertainty--Theoretical Study0
Assessing Large Language Models in Agentic Multilingual National Bias0
Show:102550
← PrevPage 77 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified