SOTAVerified

Decision Making

Papers

Showing 18011850 of 12311 papers

TitleStatusHype
Assistive AI for Augmenting Human Decision-making0
Mathematical modelling to inform outbreak response vaccination0
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement LearningCode1
Recurrent Neural Goodness-of-Fit Test for Time SeriesCode0
The Subtlety of Optimal Paternalism in a Population with Bounded Rationality0
FinQAPT: Empowering Financial Decisions with End-to-End LLM-driven Question Answering Pipeline0
Interpreting Inflammation Prediction Model via Tag-based Cohort Explanation0
Identifying High Consideration E-Commerce Search Queries0
FDF: Flexible Decoupled Framework for Time Series Forecasting with Conditional Denoising and Polynomial ModelingCode0
MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task AutomationCode1
Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval0
Measuring Free-Form Decision-Making Inconsistency of Language Models in Military Crisis SimulationsCode0
Breaking Chains: Unraveling the Links in Multi-Hop Knowledge Unlearning0
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web NavigationCode1
RGB to Hyperspectral: Spectral Reconstruction for Enhanced Surgical Imaging0
Pseudo Dataset Generation for Out-of-Domain Multi-Camera View Recommendation0
Generative Conformal Prediction with Vectorized Non-Conformity Scores0
First-Person Fairness in Chatbots0
AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative ReasoningCode1
Local Off-Grid Weather Forecasting with Multi-Modal Earth Observation DataCode2
Double-Bayesian Learning0
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL0
Using Protected Attributes to Consider Fairness in Multi-Agent Systems0
Conformity in Large Language Models0
TPFL: A Trustworthy Personalized Federated Learning Framework via Subjective Logic0
Consistency Calibration: Improving Uncertainty Calibration via Consistency among Perturbed Neighbors0
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling0
Self-DenseMobileNet: A Robust Framework for Lung Nodule Classification using Self-ONN and Stacking-based Meta-Classifier0
STRUX: An LLM for Decision-Making with Structured Explanations0
Counterfactual Effect Decomposition in Multi-Agent Sequential Decision MakingCode0
Trajectory Prediction for Autonomous Driving using Agent-Interaction Graph Embedding0
System-Level Analysis of Module Uncertainty Quantification in the Autonomy Pipeline0
A Prompt-Guided Spatio-Temporal Transformer Model for National-Wide Nuclear Radiation Forecasting0
Technical Report of 1:10 Scale Autonomous Vehicle Robot0
Explainable AI Methods for Multi-Omics Analysis: A Survey0
DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting0
Black-box Uncertainty Quantification Method for LLM-as-a-Judge0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
Process Reward Model with Q-Value RankingsCode2
Data-Driven Uncertainty-Aware Forecasting of Sea Ice Conditions in the Gulf of Ob Based on Satellite Radar Imagery0
Gender Bias of LLM in Economics: An Existentialism Perspective0
Study on the Helpfulness of Explainable Artificial IntelligenceCode0
Skill Learning Using Process Mining for Large Language Model Plan Generation0
Persistent Topological Features in Large Language ModelsCode1
Gender Bias in Decision-Making with Large Language Models: A Study of Relationship ConflictsCode0
ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera0
Stable Hadamard Memory: Revitalizing Memory-Augmented Agents for Reinforcement LearningCode1
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes0
XAI-based Feature Selection for Improved Network Intrusion Detection SystemsCode0
Show:102550
← PrevPage 37 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified