SOTAVerified

Decision Making

Papers

Showing 27012750 of 12311 papers

TitleStatusHype
Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage DomainCode0
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review0
Data Augmentation in Earth Observation: A Diffusion Model Approach0
Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context0
Can Language Models Serve as Text-Based World Simulators?0
Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer VisionCode0
Numerical solution of a PDE arising from prediction with expert adviceCode0
Observation Denoising in CYRUS Soccer Simulation 2D Team For RoboCup 2024Code0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
Cross Language Soccer Framework: An Open Source Framework for the RoboCup 2D Soccer SimulationCode0
BOSC: A toolbox for aerial imagery mappingCode0
G-Transformer: Counterfactual Outcome Prediction under Dynamic and Time-varying Treatment Regimes0
Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification0
Advancing Histopathology-Based Breast Cancer Diagnosis: Insights into Multi-Modality and Explainability0
Toward Real-Time Digital Twins of EM Environments: Computational Benchmark of Ray Launching SoftwareCode0
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals0
Predictive Dynamic FusionCode2
Tangent differential privacy0
GNNAnatomy: Rethinking Model-Level Explanations for Graph Neural Networks0
Explainability and Hate Speech: Structured Explanations Make Social Media Moderators FasterCode0
Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks0
Views about ChatGPT: Are human decision making and human learning necessary?0
Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing AgentsCode0
Leveraging automatic strategy discovery to teach people how to select better projectsCode0
Memorization in deep learning: A survey0
Robust Prediction Model for Multidimensional and Unbalanced Datasets0
MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data0
Ensembling Portfolio Strategies for Long-Term Investments: A Distribution-Free Preference Framework for Decision-Making and Algorithms0
Open Grounded Planning: Challenges and Benchmark ConstructionCode1
The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games0
Efficient Exploration of the Rashomon Set of Rule Set ModelsCode0
Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models0
Towards Understanding the Influence of Training Samples on Explanations0
Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability ObjectivesCode0
Tensor Polynomial Additive Model0
Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation0
Simplification of Risk Averse POMDPs with Performance Guarantees0
RATT: A Thought Structure for Coherent and Correct LLM ReasoningCode1
Building Socially-Equitable Public ModelsCode0
Enabling Decision-Making with the Modified Causal Forest: Policy Trees for Treatment Assignment0
Disentangled Representation via Variational AutoEncoder for Continuous Treatment Effect Estimation0
Rectifying Reinforcement Learning for Reward Matching0
Label-wise Aleatoric and Epistemic Uncertainty QuantificationCode0
Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning0
Why Would You Suggest That? Human Trust in Language Model Responses0
XRec: Large Language Models for Explainable RecommendationCode2
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?0
Enhancing predictive imaging biomarker discovery through treatment effect analysisCode0
Large Language Model-Enabled Multi-Agent Manufacturing Systems0
Uncovering dynamical equations of stochastic decision models using data-driven SINDy algorithm0
Show:102550
← PrevPage 55 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified