SOTAVerified

Decision Making

Papers

Showing 411420 of 12311 papers

TitleStatusHype
Self-Calibrating Conformal PredictionCode1
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive LossCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
Conformal Convolution and Monte Carlo Meta-learners for Predictive Inference of Individual Treatment EffectsCode1
Sym-Q: Adaptive Symbolic Regression via Sequential Decision-MakingCode1
Measuring Implicit Bias in Explicitly Unbiased Large Language ModelsCode1
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable SkillsCode1
Deep hybrid models: infer and plan in a dynamic worldCode1
LLM Voting: Human Choices and AI Collective Decision MakingCode1
Layered and Staged Monte Carlo Tree Search for SMT Strategy SynthesisCode1
Show:102550
← PrevPage 42 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified