SOTAVerified

Decision Making

Papers

Showing 41514200 of 12311 papers

TitleStatusHype
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web0
AI in Pharma for Personalized Sequential Decision-Making: Methods, Applications and Opportunities0
Efficient Baseline for Quantitative Precipitation Forecasting in Weather4cast 20230
Dynamic interactive group decision making method on two-dimensional language0
Game Projection and Robustness for Game-Theoretic Autonomous Driving0
Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification0
LLM-State: Open World State Representation for Long-horizon Task Planning with Large Language Model0
Mostly Beneficial Clustering: Aggregating Data for Operational Decision Making0
Two-Step Reinforcement Learning for Multistage Strategy Card Game0
Learning to Simulate: Generative Metamodeling via Quantile Regression0
Learning-driven Zero Trust in Distributed Computing Continuum Systems0
Joint network for specular highlight detection and adversarial generation of specular-free images trained with polarimetric dataCode0
Infection-responsivity of Commercial Dressings Through Halochromic Drop-casting0
On the Robustness of Decision-Focused LearningCode0
Model-free Test Time Adaptation for Out-Of-Distribution Detection0
The Adoption and Efficacy of Large Language Models: Evidence From Consumer Complaints in the Financial Industry0
Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based ExplanationsCode1
Towards Energysheds: A Technical Definition and Cooperative Framework for Future Power System Operations0
Automated discovery of trade-off between utility, privacy and fairness in machine learning models0
RetouchUAA: Unconstrained Adversarial Attack via Image Retouching0
A new fuzzy multi-attribute group decision-making method based on TOPSIS and optimization models0
Multi-Agent Reinforcement Learning for Power Control in Wireless Networks via Adaptive Graphs0
Utilizing Explainability Techniques for Reinforcement Learning Model AssuranceCode1
Interactive Autonomous Navigation with Internal State Inference and Interactivity Estimation0
Injecting linguistic knowledge into BERT for Dialogue State Tracking0
Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language ModelsCode1
AI-driven E-Liability Knowledge Graphs: A Comprehensive Framework for Supply Chain Carbon Accounting and Emissions Liability Management0
Benchmarking Large Language Model Volatility0
Decision Tree Psychological Risk Assessment in Currency Trading0
TORE: Token Recycling in Vision Transformers for Efficient Active Visual ExplorationCode0
Having Second Thoughts? Let's hear it0
Task adaption by biologically inspired stochastic comodulation0
Exploring Causal Learning through Graph Neural Networks: An In-depth Review0
Evaluating Large Language Models through Gender and Racial Stereotypes0
Thompson sampling for zero-inflated count outcomes with an application to the Drink Less mobile health study0
VSViG: Real-time Video-based Seizure Detection via Skeleton-based Spatiotemporal ViGCode1
History Filtering in Imperfect Information Games: Algorithms and Complexity0
Towards Interpretable Classification of Leukocytes based on Deep Learning0
How to ensure a safe control strategy? Towards a SRL for urban transit autonomous operation0
New Epochs in AI Supervision: Design and Implementation of an Autonomous Radiology AI Monitoring System0
Multi-intention Inverse Q-learning for Interpretable Behavior RepresentationCode0
To Transmit or Not to Transmit: Optimal Sensor Schedule for Remote State Estimation of Discrete-Event Systems0
Federated Learning Assisted Distributed Energy Optimization0
Learning Dynamic Selection and Pricing of Out-of-Home DeliveriesCode0
Enhancing mTBI Diagnosis with Residual Triplet Convolutional Neural Network Using 3D CT0
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach0
Deep Neural Decision Forest: A Novel Approach for Predicting Recovery or Decease of Patients0
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character DesignCode2
Robust and Interpretable COVID-19 Diagnosis on Chest X-ray Images using Adversarial Training0
Labeling Neural Representations with Inverse RecognitionCode1
Show:102550
← PrevPage 84 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified