SOTAVerified

Decision Making

Papers

Showing 701725 of 12311 papers

TitleStatusHype
Fairness in Credit Scoring: Assessment, Implementation and Profit ImplicationsCode1
Fairness in Ranking under UncertaintyCode1
Certified Reinforcement Learning with Logic GuidanceCode1
Fault-Tolerant Federated Reinforcement Learning with Theoretical GuaranteeCode1
A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed ImagesCode1
FedGCS: A Generative Framework for Efficient Client Selection in Federated Learning via Gradient-based OptimizationCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Beyond Trivial Counterfactual Explanations with Diverse Valuable ExplanationsCode1
Bias in Multimodal AI: Testbed for Fair Automatic RecruitmentCode1
Beyond Greedy Search: Tracking by Multi-Agent Reinforcement Learning-based Beam SearchCode1
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingCode1
Bidirectional Model-based Policy OptimizationCode1
Forecasting Future World Events with Neural NetworksCode1
Benchmarks for Deep Off-Policy EvaluationCode1
From Classification to Clinical Insights: Towards Analyzing and Reasoning About Mobile and Behavioral Health Data With Large Language ModelsCode1
From Parity to Preference-based Notions of Fairness in ClassificationCode1
From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision MakingCode1
AvalonBench: Evaluating LLMs Playing the Game of AvalonCode1
Benchmarking saliency methods for chest X-ray interpretationCode1
BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned ApproximationsCode1
Beyond calibration: estimating the grouping loss of modern neural networksCode1
GATSBI: Generative Agent-centric Spatio-temporal Object InteractionCode1
Generalising Discrete Action Spaces with Conditional Action TreesCode1
A Justice-Based Framework for the Analysis of Algorithmic Fairness-Utility Trade-OffsCode1
Bidirectional Representation Learning from Transformers using Multimodal Electronic Health Record Data to Predict DepressionCode1
Show:102550
← PrevPage 29 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified