SOTAVerified

Decision Making

Papers

Showing 36013650 of 12311 papers

TitleStatusHype
Satisficing Exploration in Bandit Optimization0
Decision-Making Behavior Evaluation Framework for LLMs under Uncertain Context0
Data Augmentation in Earth Observation: A Diffusion Model Approach0
Long-Term Fairness Inquiries and Pursuits in Machine Learning: A Survey of Notions, Methods, and Challenges0
Can Language Models Serve as Text-Based World Simulators?0
Risk Sensitivity in Markov Games and Multi-Agent Reinforcement Learning: A Systematic Review0
Observation Denoising in CYRUS Soccer Simulation 2D Team For RoboCup 2024Code0
Which Backbone to Use: A Resource-efficient Domain Specific Comparison for Computer VisionCode0
Data-Driven Upper Confidence Bounds with Near-Optimal Regret for Heavy-Tailed Bandits0
Cross Language Soccer Framework: An Open Source Framework for the RoboCup 2D Soccer SimulationCode0
BOSC: A toolbox for aerial imagery mappingCode0
Numerical solution of a PDE arising from prediction with expert adviceCode0
Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification0
G-Transformer: Counterfactual Outcome Prediction under Dynamic and Time-varying Treatment Regimes0
SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals0
Advancing Histopathology-Based Breast Cancer Diagnosis: Insights into Multi-Modality and Explainability0
Toward Real-Time Digital Twins of EM Environments: Computational Benchmark of Ray Launching SoftwareCode0
Explainability and Hate Speech: Structured Explanations Make Social Media Moderators FasterCode0
Views about ChatGPT: Are human decision making and human learning necessary?0
Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing AgentsCode0
Tangent differential privacy0
Memorization in deep learning: A survey0
Regularized KL-Divergence for Well-Defined Function-Space Variational Inference in Bayesian neural networks0
GNNAnatomy: Rethinking Model-Level Explanations for Graph Neural Networks0
Leveraging automatic strategy discovery to teach people how to select better projectsCode0
MagiNet: Mask-Aware Graph Imputation Network for Incomplete Traffic Data0
Ensembling Portfolio Strategies for Long-Term Investments: A Distribution-Free Preference Framework for Decision-Making and Algorithms0
Towards Understanding the Influence of Training Samples on Explanations0
Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks: An Extended Investigation0
Tensor Polynomial Additive Model0
Simplification of Risk Averse POMDPs with Performance Guarantees0
The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games0
Robust Prediction Model for Multidimensional and Unbalanced Datasets0
Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability ObjectivesCode0
Unveiling Selection Biases: Exploring Order and Token Sensitivity in Large Language Models0
Efficient Exploration of the Rashomon Set of Rule Set ModelsCode0
Enhancing predictive imaging biomarker discovery through treatment effect analysisCode0
Rectifying Reinforcement Learning for Reward Matching0
Improving Generalization in Aerial and Terrestrial Mobile Robots Control Through Delayed Policy Learning0
How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?0
Disentangled Representation via Variational AutoEncoder for Continuous Treatment Effect Estimation0
Large Language Model-Enabled Multi-Agent Manufacturing Systems0
Enabling Decision-Making with the Modified Causal Forest: Policy Trees for Treatment Assignment0
Label-wise Aleatoric and Epistemic Uncertainty QuantificationCode0
Why Would You Suggest That? Human Trust in Language Model Responses0
Building Socially-Equitable Public ModelsCode0
Automatic Input Feature Relevance via Spectral Neural NetworksCode0
AI as Decision-Maker: Ethics and Risk Preferences of LLMs0
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs0
Augmented Commonsense Knowledge for Remote Object GroundingCode0
Show:102550
← PrevPage 73 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified