SOTAVerified

Decision Making

Papers

Showing 18511900 of 12311 papers

TitleStatusHype
XAI-based Feature Selection for Improved Network Intrusion Detection SystemsCode0
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes0
Investigating Implicit Bias in Large Language Models: A Large-Scale Study of Over 50 LLMs0
ContextWIN: Whittle Index Based Mixture-of-Experts Neural Model For Restless Bandits Via Deep RL0
Learning from the past: predicting critical transitions with machine learning trained on surrogates of historical dataCode0
Evaluating Gender Bias of LLMs in Making Morality Judgements0
Adaptive Reasoning and Acting in Medical Language Agents0
WormKAN: Are KAN Effective for Identifying and Tracking Concept Drift in Time Series?0
Interpretable Video based Stress Detection with Self-Refine Chain-of-thought Reasoning0
DAWN: Designing Distributed Agents in a Worldwide Network0
A Comparative Analysis on Ethical Benchmarking in Large Language Models0
Optimized Biomedical Question-Answering Services with LLM and Multi-BERT Integration0
Sui Generis: Large Language Models for Authorship Attribution and Verification in Latin0
Ranking over Regression for Bayesian Optimization and Molecule SelectionCode0
Causal machine learning for predicting treatment outcomes0
JurEE not Judges: safeguarding llm interactions with small, specialised Encoder Ensembles0
DiffPO: A causal diffusion model for learning distributions of potential outcomesCode1
Hybrid LLM-DDQN based Joint Optimization of V2I Communication and Autonomous Driving0
Learning Representations of Instruments for Partial Identification of Treatment EffectsCode0
Variance reduction combining pre-experiment and in-experiment data0
Integrating Expert Judgment and Algorithmic Decision Making: An Indistinguishability FrameworkCode0
Transferable Belief Model on Quantum Circuits0
TRIAGE: Ethical Benchmarking of AI Models Through Mass Casualty SimulationsCode0
Large Legislative Models: Towards Efficient AI Policymaking in Economic SimulationsCode0
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare0
Decision-Aware Predictive Model Selection for Workforce Allocation0
Offline Hierarchical Reinforcement Learning via Inverse Optimization0
Explainability of Deep Neural Networks for Brain Tumor DetectionCode0
Audio Explanation Synthesis with Generative Foundation ModelsCode0
Gaussian Process Thompson Sampling via Rootfinding0
Mars: Situated Inductive Reasoning in an Open-World Environment0
A Generative AI Technique for Synthesizing a Digital Twin for U.S. Residential Solar Adoption and Generation0
Generalizable Indoor Human Activity Recognition Method Based on Micro-Doppler Corner Point Cloud and Dynamic Graph Learning0
Efficient Reinforcement Learning with Large Language Model Priors0
DisasterQA: A Benchmark for Assessing the performance of LLMs in Disaster Response0
Generating Origin-Destination Matrices in Neural Spatial Interaction ModelsCode0
Crafting desirable climate trajectories with RL explored socio-environmental simulationsCode0
The Moral Turing Test: Evaluating Human-LLM Alignment in Moral Decision-Making0
β-calibration of Language Model Confidence Scores for Generative QA0
Optimizing Estimators of Squared Calibration Errors in Classification0
Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare0
Modeling chaotic Lorenz ODE System using Scientific Machine Learning0
Rejecting Hallucinated State Targets during PlanningCode1
Reproducing and Extending Experiments in Behavioral Strategy with Large Language Models0
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
On the Modeling Capabilities of Large Language Models for Sequential Decision Making0
Towards an Operational Responsible AI Framework for Learning Analytics in Higher Education0
Cooperative and Asynchronous Transformer-based Mission Planning for Heterogeneous Teams of Mobile RobotsCode0
Navigating Inflation in Ghana: How Can Machine Learning Enhance Economic Stability and Growth Strategies0
HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian AidCode0
Show:102550
← PrevPage 38 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified