SOTAVerified

Decision Making

Papers

Showing 35513600 of 12311 papers

TitleStatusHype
Challenging the Black Box: A Comprehensive Evaluation of Attribution Maps of CNN Applications in Agriculture and Forestry0
A Note on Bias to Complete0
Dynamic planning in hierarchical active inferenceCode1
Be Persistent: Towards a Unified Solution for Mitigating Shortcuts in Deep Learning0
Multi-Generative Agent Collective Decision-Making in Urban Planning: A Case Study for Kendall Square Renovation0
Evaluating the Stability of Deep Learning Latent Feature Spaces0
BiasBuster: a Neural Approach for Accurate Estimation of Population Statistics using Biased Location Data0
Probability Tools for Sequential Random Projection0
Operational Collective Intelligence of Humans and Machines0
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language ModelsCode0
A Theory of LLM Sampling: Part Descriptive and Part Prescriptive0
Are you Struggling? Dataset and Baselines for Struggle Determination in Assembly VideosCode0
Optimizing Warfarin Dosing Using Contextual Bandit: An Offline Policy Learning and Evaluation Method0
Building Trees for Probabilistic Prediction via Scoring Rules0
Enhancing ESG Impact Type Identification through Early Fusion and Multilingual Models0
Explaining generative diffusion models via visual analysis for interpretable decision-making processCode1
Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks0
RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language ModelCode2
Rethinking Human-like Translation Strategy: Integrating Drift-Diffusion Model with Large Language Models for Machine Translation0
PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in ControlCode1
Network Formation and Dynamics Among Multi-LLMsCode0
Cloud Kitchen: Using Planning-based Composite AI to Optimize Food Delivery Processes0
A novel integrated industrial approach with cobots in the age of industry 4.0 through conversational interaction and computer vision0
RAGIC: Risk-Aware Generative Adversarial Model for Stock Interval Construction0
Generative AI and Process Systems Engineering: The Next Frontier0
Thompson Sampling in Partially Observable Contextual Bandits0
Practitioners' Challenges and Perceptions of CI Build Failure Predictions at Atlassian0
Less is more: Ensemble Learning for Retinal Disease Recognition Under Limited Resources0
Jack of All Trades, Master of Some, a Multi-Purpose Transformer AgentCode2
Second Order Methods for Bandit Optimization and Control0
Reasoning over Uncertain Text by Generative Large Language ModelsCode0
STEER: Assessing the Economic Rationality of Large Language Models0
Inference for an Algorithmic Fairness-Accuracy Frontier0
Dataset Clustering for Improved Offline Policy LearningCode0
Cross-Temporal Forecast Reconciliation at Digital Platforms with Machine Learning0
LogicPrpBank: A Corpus for Logical Implication and EquivalenceCode0
Large Language Model-Based Interpretable Machine Learning Control in Building Energy Systems0
Learning-enabled Flexible Job-shop Scheduling for Scalable Smart Manufacturing0
Computational Complexity of Preferred Subset Repairs on Data-Graphs0
Synergistic eigenanalysis of covariance and Hessian matrices for enhanced binary classification0
An Adaptive System Architecture for Multimodal Intelligent Transportation Systems0
Exploration by Optimization with Hybrid Regularizers: Logarithmic Regret with Adversarial Robustness in Partial Monitoring0
Intelligent Agricultural Management Considering N_2O Emission and Climate Variability with Uncertainties0
Average-Case Analysis of Iterative Voting0
Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary SettingsCode0
Differentially Private Distributed InferenceCode0
Fairness Auditing with Multi-Agent CollaborationCode0
Inherent Diverse Redundant Safety Mechanisms for AI-based Software Elements in Automotive Applications0
LLMs and the Human Condition0
CMA-R:Causal Mediation Analysis for Explaining Rumour DetectionCode0
Show:102550
← PrevPage 72 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified