SOTAVerified

Decision Making

Papers

Showing 18011825 of 12311 papers

TitleStatusHype
Assistive AI for Augmenting Human Decision-making0
Mathematical modelling to inform outbreak response vaccination0
Recurrent Neural Goodness-of-Fit Test for Time SeriesCode0
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement LearningCode1
FinQAPT: Empowering Financial Decisions with End-to-End LLM-driven Question Answering Pipeline0
Identifying High Consideration E-Commerce Search Queries0
The Subtlety of Optimal Paternalism in a Population with Bounded Rationality0
Interpreting Inflammation Prediction Model via Tag-based Cohort Explanation0
FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial ModelingCode0
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web NavigationCode1
Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval0
Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation0
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning0
RGB to Hyperspectral: Spectral Reconstruction for Enhanced Surgical Imaging0
Generative Conformal Prediction with Vectorized Non-Conformity Scores0
Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis SimulationsCode0
MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task AutomationCode1
First-Person Fairness in Chatbots0
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative ReasoningCode1
Local Off-Grid Weather Forecasting with Multi-Modal Earth Observation DataCode2
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL0
Double-Bayesian Learning0
Using Protected Attributes to Consider Fairness in Multi-Agent Systems0
STRUX: An LLM for Decision-Making with Structured Explanations0
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision MakingCode0
Show:102550
← PrevPage 73 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified