SOTAVerified

Decision Making

Papers

Showing 42514300 of 12311 papers

TitleStatusHype
GPT-4V(ision) Unsuitable for Clinical Care and Education: A Clinician-Evaluated Assessment0
Introducing an Improved Information-Theoretic Measure of Predictive Uncertainty0
Uncertainty Quantification in Neural-Network Based Pain Intensity Estimation0
Extrinsically-Focused Evaluation of Omissions in Medical Summarization0
Towards a Transportable Causal Network Model Based on Observational Healthcare Data0
Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization RegimeCode1
Optimising Human-AI Collaboration by Learning Convincing Explanations0
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question AnsweringCode1
Decision-making under risk: when is utility maximization equivalent to risk minimization?0
Real-Time Machine-Learning-Based Optimization Using Input Convex Long Short-Term Memory NetworkCode1
Enabling High-Level Machine Reasoning with Cognitive Neuro-Symbolic Systems0
Multi-agent Attacks for Black-box Social Recommendations0
Large Language Models for Robotics: A Survey0
An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making0
The Multi-BMBY Mechanism: Proportionality-Preserving and Strategyproof Ownership Restructuring in Private Companies0
Explainability of Vision Transformers: A Comprehensive Review and New Perspectives0
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability0
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System0
ChatGPT Exhibits Gender and Racial Biases in Acute Coronary Syndrome Management0
Business Policy Experiments using Fractional Factorial Designs: Consumer Retention on DoorDash0
MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable UncertaintyCode1
Forte: An Interactive Visual Analytic Tool for Trust-Augmented Net Load Forecasting0
Language Models can be Logical Solvers0
Polar-Net: A Clinical-Friendly Model for Alzheimer's Disease Detection in OCTA Images0
GRAM: An Interpretable Approach for Graph Anomaly Detection using Gradient Attention Maps0
Sum-max Submodular Bandits0
Hallucination-minimized Data-to-answer Framework for Financial Decision-makers0
Labor Space: A Unifying Representation of the Labor Market via Large Language Models0
Fair Wasserstein Coresets0
Exploring and Analyzing Wildland Fire Data Via Machine Learning Techniques0
Generative Explanations for Graph Neural Network: Methods and Evaluations0
Optimal simulation-based Bayesian decisions0
Long-Horizon Dialogue Understanding for Role Identification in the Game of Avalon with Large Language Models0
Green Resilience of Cyber-Physical SystemsCode0
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous DrivingCode2
ADaPT: As-Needed Decomposition and Planning with Language ModelsCode1
Toward Rapid, Optimal, and Feasible Power Dispatch through Generalized Neural Mapping0
Vital Sign Forecasting for Sepsis Patients in ICUs0
Likelihood Ratio Confidence Sets for Sequential Decision Making0
CAIS-DMA: A Decision-Making Assistant for Collaborative AI SystemsCode0
Anonymizing medical case-based explanations through disentanglement0
AI for All: Operationalising Diversity and Inclusion Requirements for AI Systems0
A Biologically-Inspired Computational Model of Time Perception0
Everything of Thoughts: Defying the Law of Penrose Triangle for Thought GenerationCode1
Adaptive Stochastic Nonlinear Model Predictive Control with Look-ahead Deep Reinforcement Learning for Autonomous Vehicle Motion Control0
Operational risk quantification of power grids using graph neural network surrogates of the DC OPF0
Evaluating Large Language Models in Ophthalmology0
An Explainable Framework for Machine learning-Based Reactive Power Optimization of Distribution Network0
Cal-DETR: Calibrated Detection TransformerCode1
Evolutionary City: Towards a Flexible, Agile and Symbiotic System0
Show:102550
← PrevPage 86 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified