SOTAVerified

Decision Making

Papers

Showing 651675 of 12311 papers

TitleStatusHype
Diverse and Admissible Trajectory Prediction through Multimodal Context UnderstandingCode1
Bayesian Optimization of Risk MeasuresCode1
Distributional GFlowNets with Quantile FlowsCode1
Bayesian Safety Validation for Failure Probability Estimation of Black-Box SystemsCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
Benchmarking Data Science AgentsCode1
Divide and Conquer: Answering Questions with Object Factorization and Compositional ReasoningCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared KnowledgeCode1
DisastIR: A Comprehensive Information Retrieval Benchmark for Disaster ManagementCode1
Benchmarking saliency methods for chest X-ray interpretationCode1
Benchmarks for Deep Off-Policy EvaluationCode1
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned ApproximationsCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine ReadingCode1
An Introduction to Deep Reinforcement LearningCode1
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam SearchCode1
Beyond calibration: estimating the grouping loss of modern neural networksCode1
From Parity to Preference-based Notions of Fairness in ClassificationCode1
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingCode1
A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort studyCode1
DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local ExplanationsCode1
Beyond Trivial Counterfactual Explanations with Diverse Valuable ExplanationsCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Distributional Counterfactual Explanations With Optimal TransportCode1
Show:102550
← PrevPage 27 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified