| Optimal sequential decision making with probabilistic digital twins | Mar 12, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making | Sep 29, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Optimization of anemia treatment in hemodialysis patients via reinforcement learning | Sep 14, 2015 | Decision MakingQ-Learning | —Unverified | 0 |
| Optimizing Fantasy Sports Team Selection with Deep Reinforcement Learning | Dec 26, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Optimizing Memory Mapping Using Deep Reinforcement Learning | May 11, 2023 | Cloud ComputingDecision Making | —Unverified | 0 |
| Optimizing Sensor Redundancy in Sequential Decision-Making Problems | Dec 10, 2024 | Decision MakingOpenAI Gym | —Unverified | 0 |
| Out-of-Distribution Adaptation in Offline RL: Counterfactual Reasoning via Causal Normalizing Flows | May 6, 2024 | Causal Inferencecounterfactual | —Unverified | 0 |
| PAC Reinforcement Learning with Rich Observations | Feb 8, 2016 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework | Jun 17, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Pareto Inverse Reinforcement Learning for Diverse Expert Policy Generation | Aug 22, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Partial-Adaptive Submodular Maximization | Nov 1, 2021 | Active LearningDecision Making | —Unverified | 0 |
| Partially Observable Stochastic Games with Neural Perception Mechanisms | Oct 17, 2023 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Partial-Monotone Adaptive Submodular Maximization | Jul 26, 2022 | Active LearningDecision Making | —Unverified | 0 |
| Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams | Oct 2, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Patterns, predictions, and actions: A story about machine learning | Feb 10, 2021 | BIG-bench Machine LearningCausal Inference | —Unverified | 0 |
| PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Pessimistic Model Selection for Offline Deep Reinforcement Learning | Nov 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Planning with General Objective Functions: Going Beyond Total Rewards | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 |
| POLAR: A Pessimistic Model-based Policy Learning Algorithm for Dynamic Treatment Regimes | Jun 25, 2025 | Sequential Decision Making | —Unverified | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs | Apr 15, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |