SOTAVerified

Decision Making

Papers

Showing 26012650 of 12311 papers

TitleStatusHype
A Unified Framework for Input Feature Attribution Analysis0
Multimodal Deformable Image Registration for Long-COVID Analysis Based on Progressive Alignment and Multi-perspective Loss0
KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning0
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
Modeling of spatially embedded networks via regional spatial graph convolutional networksCode0
Advantage Alignment Algorithms0
Self-Attention in Transformer Networks Explains Monkeys' Gaze Pattern in Pac-Man Game0
ImageFlowNet: Forecasting Multiscale Image-Level Trajectories of Disease Progression with Irregularly-Sampled Longitudinal Medical ImagesCode1
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought0
Active Learning for Fair and Stable Online Allocations0
Tractable Equilibrium Computation in Markov Games through Risk Aversion0
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing0
E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion0
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers0
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretabilityCode0
IWISDM: Assessing instruction following in multimodal models at scaleCode0
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs0
Self-supervised Interpretable Concept-based Models for Text Classification0
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond0
Research on fusing topological data analysis with convolutional neural network0
Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective0
Combining Combined Forecasts: a Network Approach0
Reinforcing Pre-trained Models Using Counterfactual Images0
Solarcast-ML: Per Node GraphCast Extension for Solar Energy Production0
FreqRISE: Explaining time series using frequency maskingCode0
Reasoning with trees: interpreting CNNs using hierarchiesCode0
Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?0
ARDuP: Active Region Video Diffusion for Universal Policies0
Utility Pole Fire Risk Inspection from 2D Street-Side Images0
SituationalLLM: Proactive language models with scene awareness for dynamic, contextual task guidanceCode0
Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation0
Statistical Uncertainty in Word Embeddings: GloVe-VCode1
UAV-based Intelligent Information Systems on Winter Road Safety for Autonomous Vehicles0
MiSuRe is all you need to explain your image segmentation0
Ask-before-Plan: Proactive Language Agents for Real-World PlanningCode1
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision MakersCode2
Investigating the Role of Explainability and AI Literacy in User Compliance0
Hoping for the best while preparing for the worst in the face of uncertainty: a new type of incomplete preferences0
Efficient Sequential Decision Making with Large Language Models0
Grade Score: Quantifying LLM Performance in Option SelectionCode0
Computing in the Life Sciences: From Early Algorithms to Modern AICode0
Online Pareto-Optimal Decision-Making for Complex Tasks using Active Inference0
Bridging Design Gaps: A Parametric Data Completion Approach With Graph Guided Diffusion Models0
Model Adaptation for Time Constrained Embodied Control0
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms0
Development of an Adaptive Multi-Domain Artificial Intelligence System Built using Machine Learning and Expert Systems Technologies0
CHG Shapley: Efficient Data Valuation and Selection towards Trustworthy Machine LearningCode0
Optimal Transport-Assisted Risk-Sensitive Q-Learning0
Calibrating Where It Matters: Constrained Temperature Scaling0
Show:102550
← PrevPage 53 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified