| Patterns, predictions, and actions: A story about machine learning | Feb 10, 2021 | BIG-bench Machine LearningCausal Inference | —Unverified | 0 | 0 |
| PDQN - A Deep Reinforcement Learning Method for Planning with Long Delays: Optimization of Manufacturing Dispatching | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Pessimistic Model Selection for Offline Deep Reinforcement Learning | Nov 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Planning with General Objective Functions: Going Beyond Total Rewards | Dec 1, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 | 0 |
| POLAR: A Pessimistic Model-based Policy Learning Algorithm for Dynamic Treatment Regimes | Jun 25, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs | Apr 15, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 | 0 |
| Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning | Jul 12, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Predicting Periodicity with Temporal Difference Learning | Sep 20, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Learning-to-defer for sequential medical decision-making under uncertainty | Sep 13, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Preference at First Sight | Jun 24, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Preference Optimization as Probabilistic Inference | Oct 5, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Reference Points, Risk-Taking Behavior, and Competitive Outcomes in Sequential Settings | Sep 20, 2024 | counterfactualDecision Making | —Unverified | 0 | 0 |
| Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients | Dec 30, 2020 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Probabilistic DAG Search | Jun 16, 2021 | Decision Makingfeature selection | —Unverified | 0 | 0 |
| Probability Tools for Sequential Random Projection | Feb 16, 2024 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Proportional Aggregation of Preferences for Sequential Decision Making | Jun 26, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes | Oct 20, 2023 | Decision MakingMulti-Task Learning | —Unverified | 0 | 0 |
| Provable Reinforcement Learning with a Short-Term Memory | Feb 8, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| PROVABLY BENEFITS OF DEEP HIERARCHICAL RL | Sep 25, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Reinforcement Learning for Adversarial Restless Multi-Armed Bandits with Unknown Transitions and Bandit Feedback | May 2, 2024 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 | 0 |
| Provably Efficient UCB-type Algorithms For Learning Predictive State Representations | Jul 1, 2023 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| Provably Learning Nash Policies in Constrained Markov Potential Games | Jun 13, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces | May 26, 2014 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Pure Exploration under Mediators' Feedback | Aug 29, 2023 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| QForce-RL: Quantized FPGA-Optimized Reinforcement Learning Compute Engine | Jun 8, 2025 | Decision MakingQuantization | —Unverified | 0 | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Q-learning with temporal memory to navigate turbulence | Apr 26, 2024 | Decision MakingNavigate | —Unverified | 0 | 0 |
| Quantile Off-Policy Evaluation via Deep Conditional Generative Learning | Dec 29, 2022 | Decision MakingOff-policy evaluation | —Unverified | 0 | 0 |
| Quantum deep recurrent reinforcement learning | Oct 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Oct 28, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models | Apr 9, 2025 | Sequential Decision Making | —Unverified | 0 | 0 |
| Real-Time Web Scale Event Summarization Using Sequential Decision Making | May 12, 2016 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Reasoning in visual navigation of end-to-end trained agents: a dynamical systems approach | Mar 11, 2025 | NavigateSequential Decision Making | —Unverified | 0 | 0 |
| Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks | Jul 13, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Rectifying Reinforcement Learning for Reward Matching | Jun 4, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Regret and Cumulative Constraint Violation Analysis for Distributed Online Constrained Convex Optimization | May 1, 2021 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Regret Bounds for Online Portfolio Selection with a Cardinality Constraint | Dec 1, 2018 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| Regret-Minimization Algorithms for Multi-Agent Cooperative Learning Systems | Oct 30, 2023 | Cloud ComputingDecision Making | —Unverified | 0 | 0 |
| Regret Minimization via Saddle Point Optimization | Mar 15, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 | 0 |
| Regular Decision Processes for Grid Worlds | Nov 5, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Apr 7, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Regularized Q-Learning with Linear Function Approximation | Jan 26, 2024 | Decision Making Under UncertaintyQ-Learning | —Unverified | 0 | 0 |
| Reinforced Approximate Exploratory Data Analysis | Dec 12, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |