SOTAVerified

Decision Making

Papers

Showing 426450 of 12311 papers

TitleStatusHype
Effective Reinforcement Learning through Evolutionary Surrogate-Assisted PrescriptionCode1
CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired TransformerCode1
GLAMOUR: Graph Learning over Macromolecule RepresentationsCode1
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and ClassificationCode1
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge SummariesCode1
ChatGPT-powered Conversational Drug Editing Using Retrieval and Domain FeedbackCode1
Emergent Linear Representations in World Models of Self-Supervised Sequence ModelsCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
EMT: Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
ChessGPT: Bridging Policy Learning and Language ModelingCode1
Engineering flexible machine learning systems by traversing functionally-invariant pathsCode1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
A Divergence Minimization Perspective on Imitation Learning MethodsCode1
Ensemble Quantile Networks: Uncertainty-Aware Reinforcement Learning with Applications in Autonomous DrivingCode1
Entropy-Regularized Token-Level Policy Optimization for Language Agent ReinforcementCode1
Epidemic Modeling with Generative AgentsCode1
Aequitas: A Bias and Fairness Audit ToolkitCode1
Ergodicity-breaking reveals time optimal decision making in humansCode1
Certified Reinforcement Learning with Logic GuidanceCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
Causal thinking for decision making on Electronic Health Records: why and howCode1
Explainable AI for computational pathology identifies model limitations and tissue biomarkersCode1
Explainable Deep Learning for Tumor Dynamic Modeling and Overall Survival Prediction using Neural-ODECode1
CELLO: Causal Evaluation of Large Vision-Language ModelsCode1
Show:102550
← PrevPage 18 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified