| Patterns, predictions, and actions: A story about machine learning | Feb 10, 2021 | BIG-bench Machine LearningCausal Inference | —Unverified | 0 | 0 |
| PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Pessimistic Model Selection for Offline Deep Reinforcement Learning | Nov 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Planning with General Objective Functions: Going Beyond Total Rewards | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 | 0 |
| POLAR: A Pessimistic Model-based Policy Learning Algorithm for Dynamic Treatment Regimes | Jun 25, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs | Apr 15, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 | 0 |
| Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning | Jul 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Predicting Periodicity with Temporal Difference Learning | Sep 20, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Learning-to-defer for sequential medical decision-making under uncertainty | Sep 13, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Preference at First Sight | Jun 24, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Preference Optimization as Probabilistic Inference | Oct 5, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Reference Points, Risk-Taking Behavior, and Competitive Outcomes in Sequential Settings | Sep 20, 2024 | counterfactualDecision Making | —Unverified | 0 | 0 |
| Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients | Dec 30, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Probabilistic DAG Search | Jun 16, 2021 | Decision Makingfeature selection | —Unverified | 0 | 0 |
| Probability Tools for Sequential Random Projection | Feb 16, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Proportional Aggregation of Preferences for Sequential Decision Making | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Oct 20, 2023 | Decision MakingMulti-Task Learning | —Unverified | 0 | 0 |
| Provable Reinforcement Learning with a Short-Term Memory | Feb 8, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| PROVABLY BENEFITS OF DEEP HIERARCHICAL RL | Sep 25, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |