SOTAVerified

Decision Making

Papers

Showing 45014550 of 12311 papers

TitleStatusHype
Contextualized Policy Recovery: Modeling and Interpreting Medical Decisions with Adaptive Imitation Learning0
Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples0
Human-Centered Evaluation of XAI Methods0
QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-CheckingCode1
Learning a Reward Function for User-Preferred Appliance SchedulingCode0
Democratizing LLMs: An Exploration of Cost-Performance Trade-offs in Self-Refined Open-Source Models0
Interaction-aware Traffic Prediction and Scenario-based Model Predictive Control for Autonomous Vehicles on Highways0
SurroCBM: Concept Bottleneck Surrogate Models for Generative Post-hoc Explanation0
Decentralization of Energy Systems with Blockchain: Bridging Top-down and Bottom-up Management of the Electricity Grid0
Explainable Image Similarity: Integrating Siamese Networks and Grad-CAMCode1
Linear Latent World Models in Simple Transformers: A Case Study on Othello-GPTCode1
NeuroInspect: Interpretable Neuron-based Debugging Framework through Class-conditional VisualizationsCode0
Imitation Learning from Purified DemonstrationsCode0
Prospective Side Information for Latent MDPs0
Adversarial Masked Image Inpainting for Robust Detection of Mpox and Non-Mpox0
Dobby: A Conversational Service Robot Driven by GPT-40
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language ModelsCode1
Evaluating Explanation Methods for Vision-and-Language Navigation0
Ensemble Active Learning by Contextual Bandits for AI Incubation in Manufacturing0
Are Large Language Models Geospatially Knowledgeable?Code0
Pain Forecasting using Self-supervised Learning and Patient Phenotyping: An attempt to prevent Opioid Addiction0
On Prediction-Modelers and Decision-Makers: Why Fairness Requires More Than a Fair Prediction Model0
Quantifying Uncertainty in Deep Learning Classification with Noise in Discrete Inputs for Risk-Based Decision Making0
Fair Classifiers that Abstain without Harm0
Distributional Soft Actor-Critic with Three RefinementsCode2
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models0
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor DiscussionsCode0
Analysis of Rainfall Variability and Water Extent of Selected Hydropower Reservoir Using Google Earth Engine (GEE): A Case Study from Two Tropical Countries, Sri Lanka and Vietnam0
A Review of the Ethics of Artificial Intelligence and its Applications in the United States0
Ethics of Artificial Intelligence and Robotics in the Architecture, Engineering, and Construction Industry0
Modeling motor control in continuous-time Active Inference: a survey0
A new economic and financial theory of money0
Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control0
Human-in-the-loop: The future of Machine Learning in Automated Electron Microscopy0
AvalonBench: Evaluating LLMs Playing the Game of AvalonCode1
Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language ModelsCode1
ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data0
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network0
Investigating the Influence of Legal Case Retrieval Systems on Users' Decision Process0
Critique Ability of Large Language Models0
Optimal Sequential Decision-Making in Geosteering: A Reinforcement Learning Approach0
Question-focused Summarization by Decomposing Articles into Facts and Opinions and Retrieving Entities0
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
Amortized Network Intervention to Steer the Excitatory Point Processes0
Graph learning in robotics: a survey0
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language ModelsCode2
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language ModelsCode6
Deep Learning for Two-Stage Robust Integer OptimizationCode1
Policy-Gradient Training of Language Models for Ranking0
Consistency Regularization Improves Placenta Segmentation in Fetal EPI MRI Time SeriesCode0
Show:102550
← PrevPage 91 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified