| A Sufficient Statistic for Influence in Structured Multiagent Environments | Jul 22, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing | May 27, 2025 | Sequential Decision Making | —Unverified | 0 |
| A Computational Framework for Motor Skill Acquisition | Jan 3, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking | Oct 21, 2020 | Decision MakingFraud Detection | —Unverified | 0 |
| Bandits with Unobserved Confounders: A Causal Approach | Dec 1, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |
| A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue | May 11, 2025 | Decision Making Under UncertaintyMulti-agent Reinforcement Learning | —Unverified | 0 |
| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Bandit Convex Optimization in Non-stationary Environments | Jul 29, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A modular framework for object-based saccadic decisions in dynamic scenes | Jun 10, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Auxiliary Reward Generation with Transition Distance Representation Learning | Feb 12, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| A Mini Review on the utilization of Reinforcement Learning with OPC UA | May 24, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Mesh Refinement | Sep 25, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Robust Goal-Based Wealth Management | Jul 25, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 |