| A Sufficient Statistic for Influence in Structured Multiagent Environments | Jul 22, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| CoordiQ : Coordinated Q-learning for Electric Vehicle Charging Recommendation | Jan 28, 2021 | Decision MakingQ-Learning | —Unverified | 0 |
| CoRAL: Collaborative Retrieval-Augmented Large Language Models Improve Long-tail Recommendation | Mar 11, 2024 | Recommendation SystemsReinforcement Learning (RL) | —Unverified | 0 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 |
| Bandits with Unobserved Confounders: A Causal Approach | Dec 1, 2015 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |
| A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue | May 11, 2025 | Decision Making Under UncertaintyMulti-agent Reinforcement Learning | —Unverified | 0 |
| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Bandit Convex Optimization in Non-stationary Environments | Jul 29, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| A Classification View on Meta Learning Bandits | Apr 6, 2025 | ClassificationMeta-Learning | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A modular framework for object-based saccadic decisions in dynamic scenes | Jun 10, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Auxiliary Reward Generation with Transition Distance Representation Learning | Feb 12, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| A Mini Review on the utilization of Reinforcement Learning with OPC UA | May 24, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes | Jul 30, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| DIP-RL: Demonstration-Inferred Preference Learning in Minecraft | Jul 22, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 |
| AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization | Jun 5, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem | Aug 21, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Autonomous Tree-search Ability of Large Language Models | Oct 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Autonomous Charging of Electric Vehicle Fleets to Enhance Renewable Generation Dispatchability | Dec 22, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming | Jun 5, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Design of intentional backdoors in sequential models | Feb 26, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning | Aug 7, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Jan 1, 2025 | ClusteringOnline Clustering | —Unverified | 0 |
| Automating Predictive Modeling Process using Reinforcement Learning | Mar 2, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Automatic Goal Generation using Dynamical Distance Learning | Mar 9, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Demystify Painting with RL | Dec 14, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Jul 25, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Automatic Goal Generation using Dynamical Distance Learning | Nov 7, 2021 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| All AI Models are Wrong, but Some are Optimal | Jan 10, 2025 | AllDecision Making | —Unverified | 0 |
| Delay and Cooperation in Nonstochastic Linear Bandits | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching | Jan 24, 2019 | Decision MakingEfficient Exploration | —Unverified | 0 |
| Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | May 23, 2025 | Motion PlanningSequential Decision Making | —Unverified | 0 |
| adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems | Mar 7, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction | Mar 3, 2017 | Decision MakingDependency Parsing | —Unverified | 0 |
| Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games | Apr 24, 2016 | Atari GamesDecision Making | —Unverified | 0 |
| Deep Q-Network-based Adaptive Alert Threshold Selection Policy for Payment Fraud Systems in Retail Banking | Oct 21, 2020 | Decision MakingFraud Detection | —Unverified | 0 |
| Automated Cyber Defence: A Review | Mar 8, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Automated Reinforcement Learning: An Overview | Jan 13, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |