SOTAVerified

Decision Making

Papers

Showing 1120111250 of 12311 papers

TitleStatusHype
CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-MakingCode0
Mesh-Informed Reduced Order Models for Aneurysm Rupture Risk PredictionCode0
Probabilistic Logic Programming with Beta-Distributed Random VariablesCode0
EVKG: An Interlinked and Interoperable Electric Vehicle Knowledge Graph for Smart Transportation SystemCode0
How Inclusively do LMs Perceive Social and Moral Norms?Code0
Network Formation and Dynamics Among Multi-LLMsCode0
Self-supervised Multi-modal Training from Uncurated Image and Reports Enables Zero-shot Oversight Artificial Intelligence in RadiologyCode0
How Personality Traits Influence Negotiation Outcomes? A Simulation based on Large Language ModelsCode0
Evolutionary Multi-Armed Bandits with Genetic Thompson SamplingCode0
An Analysis of Robustness of Non-Lipschitz NetworksCode0
Action and Perception as Divergence MinimizationCode0
On the Privacy Risks of Algorithmic FairnessCode0
How Robust is your Fair Model? Exploring the Robustness of Diverse Fairness StrategiesCode0
Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observabilityCode0
Metacontrol for Adaptive Imagination-Based OptimizationCode0
How Should We Represent History in Interpretable Models of Clinical Policies?Code0
Learning Classifier Systems for Self-Explaining Socio-Technical-SystemsCode0
Automated decision-making for dynamic task assignment at scaleCode0
QAGCN: Answering Multi-Relation Questions via Single-Step Implicit Reasoning over Knowledge GraphsCode0
Achieving Fairness in DareFightingICE Agents Evaluation Through a Delay MechanismCode0
Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language ModelsCode0
Deep Neuroevolution of Recurrent and Discrete World ModelsCode0
How to Control Hydrodynamic Force on Fluidic Pinball via Deep Reinforcement LearningCode0
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal ControlCode0
Improving the forecast accuracy of wind power by leveraging multiple hierarchical structureCode0
An Unsupervised Video Game Playstyle Metric via State DiscretizationCode0
Learning Coordination Policies over Heterogeneous Graphs for Human-Robot Teams via Recurrent Neural Schedule PropagationCode0
Learning credit assignmentCode0
An Underexplored Dilemma between Confidence and Calibration in Quantized Neural NetworksCode0
Combining unsupervised and supervised learning for predicting the final stroke lesionCode0
Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPsCode0
Learning Decision Policies with Instrumental Variables through Double Machine LearningCode0
AgentSimulator: An Agent-based Approach for Data-driven Business Process SimulationCode0
How Well Do Feature-Additive Explainers Explain Feature-Additive Predictors?Code0
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational AutoencodersCode0
Learning Discrete State Abstractions With Deep Variational InferenceCode0
Alpha Elimination: Using Deep Reinforcement Learning to Reduce Fill-In during Sparse Matrix DecompositionCode0
Neural Bayesian Network UnderstudyCode0
Excitation Dropout: Encouraging Plasticity in Deep Neural NetworksCode0
ExClaim: Explainable Neural Claim Verification Using RationalizationCode0
Meta-Models: An Architecture for Decoding LLM Behaviors Through Interpreted Embeddings and Natural LanguageCode0
Deep Learning for Predicting Dynamic Uncertain Opinions in Network DataCode0
Learning Dynamic Cognitive Map with Autonomous NavigationCode0
Learning Dynamic Graphs from All Contextual Information for Accurate Point-of-Interest Visit ForecastingCode0
Bridging the Gap: Protocol Towards Fair and Consistent Affect AnalysisCode0
Learning Dynamic Selection and Pricing of Out-of-Home DeliveriesCode0
Deep Learning for Patient-Specific Kidney Graft Survival AnalysisCode0
On the Robustness of Adversarial Training Against Uncertainty AttacksCode0
Metaphor Detection with Cross-Lingual Model TransferCode0
Human-Algorithm Collaborative Bayesian Optimization for Engineering SystemsCode0
Show:102550
← PrevPage 225 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified