SOTAVerified

Decision Making

Papers

Showing 1052610550 of 12311 papers

TitleStatusHype
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?Code0
Counterfactual Generative Models for Time-Varying TreatmentsCode0
Achieving Long-Term Fairness in Sequential Decision MakingCode0
Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon GameCode0
Reasoning about Counterfactuals to Improve Human Inverse Reinforcement LearningCode0
Better Batch for Deep Probabilistic Time Series ForecastingCode0
LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned ModelCode0
Amortized Bayesian Decision Making for simulation-based modelsCode0
Functional Linear Regression of Cumulative Distribution FunctionsCode0
Distance Weighted Supervised Learning for Offline Interaction DataCode0
A Risk-Sensitive Approach to Policy OptimizationCode0
Loss-Aversively Fair ClassificationCode0
Fundamental Limits for Sensor-Based Robot ControlCode0
Loss Bounds for Approximate Influence-Based AbstractionCode0
Reinforced Cross-modal Alignment for Radiology Report GenerationCode0
Counterfactual Fairness by Combining Factual and Counterfactual PredictionsCode0
Don't Throw it Away! The Utility of Unlabeled Data in Fair Decision MakingCode0
Interpretable Outcome Prediction with Sparse Bayesian Neural Networks in Intensive CareCode0
Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic CollaborationCode0
Do Performance Aspirations Matter for Guiding Software Configuration Tuning?Code0
DoRA: Domain-Based Self-Supervised Learning Framework for Low-Resource Real Estate AppraisalCode0
Disentangled behavioural representationsCode0
Challenging common interpretability assumptions in feature attribution explanationsCode0
Do the Machine Learning Models on a Crowd Sourced Platform Exhibit Bias? An Empirical Study on Model FairnessCode0
Discrete-Time Distribution Steering using Monte Carlo Tree SearchCode0
Show:102550
← PrevPage 422 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified