SOTAVerified

Decision Making

Papers

Showing 451475 of 12311 papers

TitleStatusHype
Explaining Autonomous Driving Actions with Visual Question AnsweringCode1
Explaining generative diffusion models via visual analysis for interpretable decision-making processCode1
Certified Reinforcement Learning with Logic GuidanceCode1
Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine ReadingCode1
Failure Detection in Medical Image Classification: A Reality Check and Benchmarking TestbedCode1
Fair and Optimal Classification via Post-ProcessingCode1
CELLO: Causal Evaluation of Large Vision-Language ModelsCode1
Fairness Constraints: Mechanisms for Fair ClassificationCode1
Fairness in Ranking under UncertaintyCode1
Fairness Through Robustness: Investigating Robustness Disparity in Deep LearningCode1
Faithfully Explainable Recommendation via Neural Logic ReasoningCode1
Fast Interpretable Greedy-Tree SumsCode1
CertRL: Formalizing Convergence Proofs for Value and Policy Iteration in CoqCode1
FedGCS: A Generative Framework for Efficient Client Selection in Federated Learning via Gradient-based OptimizationCode1
Causal Discovery with Language Models as Imperfect ExpertsCode1
Causal thinking for decision making on Electronic Health Records: why and howCode1
CFGPT: Chinese Financial Assistant with Large Language ModelCode1
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMsCode1
Flexible Job Shop Scheduling via Dual Attention Network Based Reinforcement LearningCode1
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
Can Learned Optimization Make Reinforcement Learning Less Difficult?Code1
Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing InducementsCode1
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language ModelsCode1
From Parity to Preference-based Notions of Fairness in ClassificationCode1
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language ModelsCode1
Show:102550
← PrevPage 19 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified