SOTAVerified

Decision Making

Papers

Showing 41514175 of 12311 papers

TitleStatusHype
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web0
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities0
Efficient Baseline for Quantitative Precipitation Forecasting in Weather4cast 20230
Dynamic interactive group decision making method on two-dimensional language0
Game Projection and Robustness for Game-Theoretic Autonomous Driving0
Learning to Simulate: Generative Metamodeling via Quantile Regression0
Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification0
Two-Step Reinforcement Learning for Multistage Strategy Card Game0
LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model0
Learning-driven Zero Trust in Distributed Computing Continuum Systems0
Mostly Beneficial Clustering: Aggregating Data for Operational Decision Making0
Joint network for specular highlight detection and adversarial generation of specular-free images trained with polarimetric dataCode0
Infection-responsivity of Commercial Dressings Through Halochromic Drop-casting0
On the Robustness of Decision-Focused LearningCode0
Model-free Test Time Adaptation for Out-Of-Distribution Detection0
Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based ExplanationsCode1
The Adoption and Efficacy of Large Language Models: Evidence From Consumer Complaints in the Financial Industry0
Towards Energysheds: A Technical Definition and Cooperative Framework for Future Power System Operations0
Automated discovery of trade-off between utility, privacy and fairness in machine learning models0
RetouchUAA: Unconstrained Adversarial Attack via Image Retouching0
Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation0
A new fuzzy multi-attribute group decision-making method based on TOPSIS and optimization models0
Multi-Agent Reinforcement Learning for Power Control in Wireless Networks via Adaptive Graphs0
Injecting linguistic knowledge into BERT for Dialogue State Tracking0
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language ModelsCode1
Show:102550
← PrevPage 167 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified