| Deep Reinforcement Learning for Adaptive Mesh Refinement | Sep 25, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons | May 23, 2025 | Motion PlanningSequential Decision Making | —Unverified | 0 |
| adaPARL: Adaptive Privacy-Aware Reinforcement Learning for Sequential-Decision Making Human-in-the-Loop Systems | Mar 7, 2023 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction | Mar 3, 2017 | Decision MakingDependency Parsing | —Unverified | 0 |
| Automating Predictive Modeling Process using Reinforcement Learning | Mar 2, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications | Dec 31, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games | Apr 24, 2016 | Atari GamesDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks | Apr 25, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Portfolio Optimization using Latent Feature State Space (LFSS) Module | Feb 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Robust Goal-Based Wealth Management | Jul 25, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Autonomous Charging of Electric Vehicle Fleets to Enhance Renewable Generation Dispatchability | Dec 22, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Selective Network Discovery via Deep Reinforcement Learning on Embedded Spaces | Sep 16, 2019 | AttributeDecision Making | —Unverified | 0 |
| Autonomous Tree-search Ability of Large Language Models | Oct 14, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Visual Object Tracking in Videos | Jan 31, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| AutoGuide: Automated Generation and Selection of Context-Aware Guidelines for Large Language Model Agents | Mar 13, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| Deep Robust Kalman Filter | Mar 7, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems | May 2, 2025 | Decision MakingPrediction Intervals | —Unverified | 0 |
| Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework | Aug 3, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Delay and Cooperation in Nonstochastic Linear Bandits | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Delayed Feedback in Generalised Linear Bandits Revisited | Jul 21, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Delays in Reinforcement Learning | Sep 20, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| AVID: Adapting Video Diffusion Models to World Models | Oct 1, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts | Jan 1, 2025 | ClusteringOnline Clustering | —Unverified | 0 |
| Demystify Painting with RL | Dec 14, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Bandit based centralized matching in two-sided markets for peer to peer lending | May 6, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Design of intentional backdoors in sequential models | Feb 26, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Bandit Convex Optimization in Non-stationary Environments | Jul 29, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management | Feb 6, 2021 | Decision MakingManagement | —Unverified | 0 |
| Differentiable Quantum Architecture Search in Asynchronous Quantum Reinforcement Learning | Jul 25, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games | Mar 8, 2021 | counterfactualDecision Making | —Unverified | 0 |
| Bandits in Matching Markets: Ideas and Proposals for Peer Lending | Oct 30, 2020 | Decision MakingFairness | —Unverified | 0 |
| Digital Twins for forecasting and decision optimisation with machine learning: applications in wastewater treatment | Apr 23, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Dimension-Free Rates for Natural Policy Gradient in Multi-Agent Reinforcement Learning | Sep 23, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| DIP-RL: Demonstration-Inferred Preference Learning in Minecraft | Jul 22, 2023 | Decision MakingMinecraft | —Unverified | 0 |
| Direct and indirect reinforcement learning | Dec 23, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning | Apr 20, 2021 | ClusteringDecision Making | —Unverified | 0 |
| Batched Neural Bandits | Feb 25, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Computational Framework for Motor Skill Acquisition | Jan 3, 2019 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox | Dec 1, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Distributed Learning: Sequential Decision Making in Resource-Constrained Environments | Apr 13, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC | Mar 16, 2024 | Decision MakingEdge-computing | —Unverified | 0 |
| Distributed Online Learning in Social Recommender Systems | Sep 26, 2013 | Decision MakingRecommendation Systems | —Unverified | 0 |
| Distributed Optimization via Kernelized Multi-armed Bandits | Dec 7, 2023 | Decision MakingDistributed Optimization | —Unverified | 0 |
| Distributional Robustness and Regularization in Reinforcement Learning | Mar 5, 2020 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning | Apr 23, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Divide-and-Conquer Monte Carlo Tree Search | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Bayesian Estimation for Dynamic Treatment Regimes with a Long Follow-up Time | Sep 20, 2021 | Decision Makingregression | —Unverified | 0 |
| Algorithms for CVaR Optimization in MDPs | Jun 12, 2014 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving | Nov 22, 2022 | Autonomous DrivingDecision Making | —Unverified | 0 |
| A Law of Iterated Logarithm for Multi-Agent Reinforcement Learning | Oct 27, 2021 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |