SOTAVerified

Decision Making

Papers

Showing 1110111150 of 12311 papers

TitleStatusHype
Safe and Adaptive Decision-Making for Optimization of Safety-Critical Systems: The ARTEO AlgorithmCode0
A Lyapunov-based Approach to Safe Reinforcement LearningCode0
Active Learning for Decision-Making from Imbalanced Observational DataCode0
ET5: A Novel End-to-end Framework for Conversational Machine Reading ComprehensionCode0
High-dimensional forecasting with known knowns and known unknownsCode0
Measuring the Stability of Process Outcome Predictions in Online SettingsCode0
Rationalising data collection for supporting decision making in building energy systems using Value of Information analysisCode0
ValueDCG: Measuring Comprehensive Human Value Understanding Ability of Language ModelsCode0
Measuring Agreeableness Bias in Multimodal ModelsCode0
Reinforcement Learning based Collective Entity Alignment with Adaptive FeaturesCode0
Exploring the Jungle of Bias: Political Bias Attribution in Language Models via Dependency AnalysisCode0
Calibrating the Rigged Lottery: Making All Tickets ReliableCode0
Higher-order Neural Additive Models: An Interpretable Machine Learning Model with Feature InteractionsCode0
AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events dataCode0
Deep Reinforcement Learning for Chinese Zero pronoun ResolutionCode0
On the Impact of Feeding Cost Risk in Aquaculture Valuation and Decision MakingCode0
High-Fidelity Transfer of Functional Priors for Wide Bayesian Neural Networks by Learning ActivationsCode0
Navigating the Synthetic Realm: Harnessing Diffusion-based Models for Laparoscopic Text-to-Image GenerationCode0
Autoregressive BanditsCode0
A code-driven tutorial on encrypted control: From pioneering realizations to modern implementationsCode0
PatchCTG: Patch Cardiotocography Transformer for Antepartum Fetal Health MonitoringCode0
"A Good Bot Always Knows Its Limitations": Assessing Autonomous System Decision-making Competencies through Factorized Machine Self-confidenceCode0
A perspective on multi-agent communication for information fusionCode0
Calibrating Deep Convolutional Gaussian ProcessesCode0
Latent Spaces Enable Transformer-Based Dose Prediction in Complex Radiotherapy PlansCode0
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language ModelsCode0
Evaluating Deep Taylor Decomposition for Reliability Assessment in the WildCode0
N-BEATS neural network for mid-term electricity load forecastingCode0
High-resolution agent-based modeling of COVID-19 spreading in a small townCode0
Calibrated Optimal Decision Making with Multiple Data Sources and Limited OutcomeCode0
On the (im)possibility of fairnessCode0
Auto-Platoon : Freight by exampleCode0
ISLE: An Intelligent Streaming Framework for High-Throughput AI Inference in Medical ImagingCode0
CAIS-DMA: A Decision-Making Assistant for Collaborative AI SystemsCode0
Agent-State Construction with Auxiliary InputsCode0
Near-Optimal Non-Parametric Sequential Tests and Confidence Sequences with Possibly Dependent ObservationsCode0
Hindsight and Sequential Rationality of Correlated PlayCode0
Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement LearningCode0
Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market SimulationsCode0
Pragmatic Image Compression for Human-in-the-Loop Decision-MakingCode0
Hindsight Learning for MDPs with Exogenous InputsCode0
On the Expressiveness of Approximate Inference in Bayesian Neural NetworksCode0
Byzantine-Robust Distributed Online Learning: Taming Adversarial Participants in An Adversarial EnvironmentCode0
Evaluating Machine Learning Models against Clinical Protocols for Enhanced Interpretability and Continuity of CareCode0
Evaluating model calibration in classificationCode0
PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology ImagesCode0
Active Learning for Argument Strength EstimationCode0
Hire Me or Not? Examining Language Model's Behavior with Occupation AttributesCode0
Evaluating Short-Term Forecasting of Multiple Time Series in IoT EnvironmentsCode0
Autonomous Capability Assessment of Sequential Decision-Making Systems in Stochastic Settings (Extended Version)Code0
Show:102550
← PrevPage 223 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified