SOTAVerified

Decision Making

Papers

Showing 651700 of 12311 papers

TitleStatusHype
Do graph neural networks learn traditional jet substructure?Code1
Bayesian Safety Validation for Failure Probability Estimation of Black-Box SystemsCode1
Does the Markov Decision Process Fit the Data: Testing for the Markov Property in Sequential Decision MakingCode1
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
Domain Generalization via Rationale InvarianceCode1
Benchmarking LLMs for Political Science: A United Nations PerspectiveCode1
Benchmarking Data Science AgentsCode1
DocSegTr: An Instance-Level End-to-End Document Image Segmentation TransformerCode1
Benchmarking saliency methods for chest X-ray interpretationCode1
Benchmarks for Deep Off-Policy EvaluationCode1
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned ApproximationsCode1
From point forecasts to multivariate probabilistic forecasts: The Schaake shuffle for day-ahead electricity price forecastingCode1
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World ModellingCode1
DORA: Exploring Outlier Representations in Deep Neural NetworksCode1
EDGE COVID-19: A Web Platform to generate submission-ready genomes for SARS-CoV-2 sequencing effortsCode1
From Attribution Maps to Human-Understandable Explanations through Concept Relevance PropagationCode1
Diverse and Admissible Trajectory Prediction through Multimodal Context UnderstandingCode1
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingCode1
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam SearchCode1
A novel interpretable machine learning system to generate clinical risk scores: An application for predicting early mortality or unplanned readmission in a retrospective cohort studyCode1
Beyond Trivial Counterfactual Explanations with Diverse Valuable ExplanationsCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Bias in Multimodal AI: Testbed for Fair Automatic RecruitmentCode1
Generalized Linear Bandits with Local Differential PrivacyCode1
Bidirectional Model-based Policy OptimizationCode1
Bidirectional Representation Learning from Transformers using Multimodal Electronic Health Record Data to Predict DepressionCode1
Generating Hierarchical Explanations on Text Classification via Feature Interaction DetectionCode1
Generating Synthetic Mixed-type Longitudinal Electronic Health Records for Artificial Intelligent ApplicationsCode1
Divide and Conquer: Answering Questions with Object Factorization and Compositional ReasoningCode1
Distributive Justice as the Foundational Premise of Fair ML: Unification, Extension, and Interpretation of Group Fairness MetricsCode1
GMAI-VL-R1: Harnessing Reinforcement Learning for Multimodal Medical ReasoningCode1
GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature FieldsCode1
An Objective Metric for Explainable AI: How and Why to Estimate the Degree of ExplainabilityCode1
Diverse and Admissible Trajectory Forecasting through Multimodal Context UnderstandingCode1
Dissecting and Mitigating Diffusion Bias via Mechanistic InterpretabilityCode1
An Introduction to Deep Reinforcement LearningCode1
Distributional GFlowNets with Quantile FlowsCode1
Brain Tumor Segmentation and Radiomics Survival Prediction: Contribution to the BRATS 2017 ChallengeCode1
Discriminative Particle Filter Reinforcement Learning for Complex Partial ObservationsCode1
BoWFire: Detection of Fire in Still Images by Integrating Pixel Color and Texture AnalysisCode1
Group-Aware Coordination Graph for Multi-Agent Reinforcement LearningCode1
Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop FeedbackCode1
Bridging POMDPs and Bayesian decision making for robust maintenance planning under model uncertainty: An application to railway systemsCode1
G-Transformer for Conditional Average Potential Outcome Estimation over TimeCode1
Bundle Recommendation with Graph Convolutional NetworksCode1
BuildingView: Constructing Urban Building Exteriors Databases with Street View Imagery and Multimodal Large Language ModeCode1
Distributional Counterfactual Explanations With Optimal TransportCode1
Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge GraphsCode1
Examining Inter-Consistency of Large Language Models Collaboration: An In-depth Analysis via DebateCode1
DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local ExplanationsCode1
Show:102550
← PrevPage 14 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified