SOTAVerified

Decision Making

Papers

Showing 35013550 of 12311 papers

TitleStatusHype
Towards a Science Exocortex0
Imperfect-Recall Games: Equilibrium Concepts and Their Complexity0
Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGACode0
CAV-AHDV-CAV: Mitigating Traffic Oscillations for CAVs through a Novel Car-Following Structure and Reinforcement Learning0
Accelerating Matrix Diagonalization through Decision Transformers with Epsilon-Greedy Optimization0
Learning Abstract World Model for Value-preserving Planning with Options0
Privacy Implications of Explainable AI in Data-Driven Systems0
Adaptive Digital Twin and Communication-Efficient Federated Learning Network Slicing for 5G-enabled Internet of Things0
CaT-BENCH: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans0
A Unified Framework for Input Feature Attribution Analysis0
KnobTree: Intelligent Database Parameter Configuration via Explainable Reinforcement Learning0
PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology ImagesCode0
Multimodal Deformable Image Registration for Long-COVID Analysis Based on Progressive Alignment and Multi-perspective Loss0
Catastrophic-risk-aware reinforcement learning with extreme-value-theory-based policy gradientsCode0
Urban-Focused Multi-Task Offline Reinforcement Learning with Contrastive Data Sharing0
REVEAL-IT: REinforcement learning with Visibility of Evolving Agent poLicy for InTerpretabilityCode0
Advantage Alignment Algorithms0
E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion0
Self-Attention in Transformer Networks Explains Monkeys' Gaze Pattern in Pac-Man Game0
IWISDM: Assessing instruction following in multimodal models at scaleCode0
Tractable Equilibrium Computation in Markov Games through Risk Aversion0
MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs0
Self-supervised Interpretable Concept-based Models for Text Classification0
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought0
Modeling of spatially embedded networks via regional spatial graph convolutional networksCode0
CEBench: A Benchmarking Toolkit for the Cost-Effectiveness of LLM PipelinesCode0
Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers0
Active Learning for Fair and Stable Online Allocations0
SituationalLLM: Proactive language models with scene awareness for dynamic, contextual task guidanceCode0
Research on fusing topological data analysis with convolutional neural network0
Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective0
FreqRISE: Explaining time series using frequency maskingCode0
Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma?0
Reinforcing Pre-trained Models Using Counterfactual Images0
Combining Combined Forecasts: a Network Approach0
ARDuP: Active Region Video Diffusion for Universal Policies0
Learned Graph Rewriting with Equality Saturation: A New Paradigm in Relational Query Rewrite and Beyond0
Reasoning with trees: interpreting CNNs using hierarchiesCode0
Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation0
Solarcast-ML: Per Node GraphCast Extension for Solar Energy Production0
Utility Pole Fire Risk Inspection from 2D Street-Side Images0
UAV-based Intelligent Information Systems on Winter Road Safety for Autonomous Vehicles0
Investigating the Role of Explainability and AI Literacy in User Compliance0
MiSuRe is all you need to explain your image segmentation0
Hoping for the best while preparing for the worst in the face of uncertainty: a new type of incomplete preferences0
Optimal Transport-Assisted Risk-Sensitive Q-Learning0
Grade Score: Quantifying LLM Performance in Option SelectionCode0
Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms0
Computing in the Life Sciences: From Early Algorithms to Modern AICode0
Efficient Sequential Decision Making with Large Language Models0
Show:102550
← PrevPage 71 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified