SOTAVerified

Decision Making

Papers

Showing 29212930 of 12311 papers

TitleStatusHype
Conversational Disease Diagnosis via External Planner-Controlled Large Language ModelsCode0
Counterpart Fairness -- Addressing Systematic between-group Differences in Fairness EvaluationCode0
Convex optimization for actionable \& plausible counterfactual explanationsCode0
A knowledge-driven vowel-based approach of depression classification from speech using data augmentationCode0
Optimal decision making in robotic assembly and other trial-and-error tasksCode0
Explainability of Deep Neural Networks for Brain Tumor DetectionCode0
Information Gathering in Decentralized POMDPs by Policy Graph ImprovementCode0
Prediction, Consistency, Curvature: Representation Learning for Locally-Linear ControlCode0
CookDial: A dataset for task-oriented dialogs grounded in procedural documentsCode0
Piecing Together Clues: A Benchmark for Evaluating the Detective Skills of Large Language Models0
Show:102550
← PrevPage 293 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified