SOTAVerified

Decision Making

Papers

Showing 10011050 of 12311 papers

TitleStatusHype
Is LLM an Overconfident Judge? Unveiling the Capabilities of LLMs in Detecting Offensive Language with Annotation DisagreementCode0
Beyond Batch Learning: Global Awareness Enhanced Domain Adaptation0
Motion Forecasting for Autonomous Vehicles: A Survey0
Words or Numbers? How Framing Uncertainties Affects Risk Assessment and Decision-Making0
Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation0
Dynamic Pricing with Adversarially-Censored Demands0
Contextual Thompson Sampling via Generation of Missing Data0
Koopman-Equivariant Gaussian Processes0
Intelligent Offloading in Vehicular Edge Computing: A Comprehensive Review of Deep Reinforcement Learning Approaches and Architectures0
Decision Making in Hybrid Environments: A Model Aggregation Approach0
MTPChat: A Multimodal Time-Aware Persona Dataset for Conversational Agents0
Digital Twin Buildings: 3D Modeling, GIS Integration, and Visual Descriptions Using Gaussian Splatting, ChatGPT/Deepseek, and Google Maps Platform0
Projection-based Lyapunov method for fully heterogeneous weakly-coupled MDPs0
Polynomial Regret Concentration of UCB for Non-Deterministic State Transitions0
A Survey on Explainable Deep Reinforcement Learning0
Closing the Responsibility Gap in AI-based Network Management: An Intelligent Audit System Approach0
Managing Geological Uncertainty in Critical Mineral Supply Chains: A POMDP Approach with Application to U.S. Lithium Resources0
Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language ModelsCode0
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?Code0
Pareto-Optimality, Smoothness, and Stochasticity in Learning-Augmented One-Max-Search0
Agentic AI Systems Applied to tasks in Financial Services: Modeling and model risk management crews0
Discounting under inequality and lobbyists disagreement0
Agentic Reasoning: Reasoning LLMs with Tools for the Deep ResearchCode0
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization0
Conformal Prediction for Electricity Price Forecasting in the Day-Ahead and Real-Time Balancing Market0
Bridging Voting and Deliberation with Algorithms: Field Insights from vTaiwan and Kultur Komitee0
PRISM: A Robust Framework for Skill-based Meta-Reinforcement Learning with Noisy Demonstrations0
Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation0
Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making0
Fairness Aware Reinforcement Learning via Proximal Policy Optimization0
Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study0
CAPE: Covariate-Adjusted Pre-Training for Epidemic Time Series Forecasting0
Aero-LLM: A Distributed Framework for Secure UAV Communication and Intelligent Decision-Making0
ScholaWrite: A Dataset of End-to-End Scholarly Writing Process0
Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms0
Early Stopping in Contextual Bandits and Inferences0
How Inclusively do LMs Perceive Social and Moral Norms?Code0
Theoretical Frameworks for Integrating Sustainability Factors into Institutional Investment Decision-Making0
Runway capacity expansion planning for public airports under demand uncertainty0
Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents0
Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models0
Online Clustering of Dueling Bandits0
On the Guidance of Flow MatchingCode2
Anytime Incremental ρPOMDP Planning in Continuous Spaces0
CH-MARL: Constrained Hierarchical Multiagent Reinforcement Learning for Sustainable Maritime Logistics0
VLA-Cache: Towards Efficient Vision-Language-Action Model via Adaptive Token Caching in Robotic Manipulation0
From Divergence to Consensus: Evaluating the Role of Large Language Models in Facilitating Agreement through Adaptive Strategies0
Position: Empowering Time Series Reasoning with Multimodal LLMs0
Can Domain Experts Rely on AI Appropriately? A Case Study on AI-Assisted Prostate Cancer MRI Diagnosis0
Diffusion Model for Multiple Antenna Communications0
Show:102550
← PrevPage 21 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified