SOTAVerified

Decision Making

Papers

Showing 826850 of 12311 papers

TitleStatusHype
Dissecting the Impact of Model Misspecification in Data-Driven Optimization0
Shaping Laser Pulses with Reinforcement Learning0
Semi-Parametric Batched Global Multi-Armed Bandits with Covariates0
Interacting with AI Reasoning Models: Harnessing "Thoughts" for AI-Driven Software Engineering0
Shifting Power: Leveraging LLMs to Simulate Human Aversion in ABMs of Bilateral Financial Exchanges, A bond market study0
What Makes a Good Diffusion Planner for Decision Making?Code2
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language ModelsCode0
Ro-To-Go! Robust Reactive Control with Signal Temporal Logic0
Digital Player: Evaluating Large Language Models based Human-like Agent in GamesCode2
Adaptive Reinforcement Learning for State Avoidance in Discrete Event Systems0
Llamarine: Open-source Maritime Industry-specific Large Language Model0
Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations0
Investigating the Relationship Between Debiasing and Artifact Removal using Saliency Maps0
A Deep User Interface for Exploring LLaMa0
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
Advanced Deep Learning Techniques for Analyzing Earnings Call Transcripts: Methodologies and Applications0
Non-Cooperative Games with Uncertainty0
Efficient Risk-sensitive Planning via Entropic Risk Measures0
Large Language Model Strategic Reasoning Evaluation through Behavioral Game Theory0
Deep Reinforcement Learning based Autonomous Decision-Making for Cooperative UAVs: A Search and Rescue Real World Application0
Minds on the Move: Decoding Trajectory Prediction in Autonomous Driving with Cognitive Insights0
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired TransformerCode1
Can a calibration metric be both testable and actionable?Code0
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management StrategiesCode0
Program Synthesis Dialog Agents for Interactive Decision-MakingCode0
Show:102550
← PrevPage 34 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified