SOTAVerified

Decision Making

Papers

Showing 27012725 of 12311 papers

TitleStatusHype
Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context0
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review0
Data Augmentation in Earth Observation: A Diffusion Model Approach0
Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage DomainCode0
Can Language Models Serve as Text-Based World Simulators?0
Numerical solution of a PDE arising from prediction with expert adviceCode0
Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer VisionCode0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
Observation Denoising in CYRUS Soccer Simulation 2D Team For RoboCup 2024Code0
Cross Language Soccer Framework: An Open Source Framework for the RoboCup 2D Soccer SimulationCode0
BOSC: A toolbox for aerial imagery mappingCode0
G-Transformer: Counterfactual Outcome Prediction under Dynamic and Time-varying Treatment Regimes0
Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification0
Advancing Histopathology-Based Breast Cancer Diagnosis: Insights into Multi-Modality and Explainability0
Toward Real-Time Digital Twins of EM Environments: Computational Benchmark of Ray Launching SoftwareCode0
Predictive Dynamic FusionCode2
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals0
Tangent differential privacy0
GNNAnatomy: Rethinking Model-Level Explanations for Graph Neural Networks0
Views about ChatGPT: Are human decision making and human learning necessary?0
Explainability and Hate Speech: Structured Explanations Make Social Media Moderators FasterCode0
Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks0
Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing AgentsCode0
Memorization in deep learning: A survey0
Leveraging automatic strategy discovery to teach people how to select better projectsCode0
Show:102550
← PrevPage 109 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified