| Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning | Sep 28, 2020 | Decision MakingManagement | —Unverified | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Policy Gradient With Serial Markov Chain Reasoning | Oct 13, 2022 | Decision MakingMuJoCo | —Unverified | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System | Oct 1, 2014 | Decision MakingDialogue Management | —Unverified | 0 |
| Policy Learning with a Natural Language Action Space: A Causal Approach | Feb 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 |
| Policy Learning with Asymmetric Counterfactual Utilities | Jun 21, 2022 | counterfactualDecision Making | —Unverified | 0 |
| Policy Optimization Using Semi-parametric Models for Dynamic Pricing | Sep 13, 2021 | Decision Making | —Unverified | 0 |
| Policy Optimization with Model-based Explorations | Nov 18, 2018 | Atari GamesDecision Making | —Unverified | 0 |
| Policy Regularization for Legible Behavior | Mar 8, 2022 | Decision Making | —Unverified | 0 |
| Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning | May 30, 2024 | Decision MakingModel Selection | —Unverified | 0 |
| Polynomial Regret Concentration of UCB for Non-Deterministic State Transitions | Feb 9, 2025 | Decision Making | —Unverified | 0 |
| POMDPs in Continuous Time and Discrete Spaces | Oct 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| POPPINS : A Population-Based Digital Spiking Neuromorphic Processor with Integer Quadratic Integrate-and-Fire Neurons | Jan 19, 2022 | Decision Making | —Unverified | 0 |
| Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning | Mar 2, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Population stratification enables modeling effects of reopening policies on mortality and hospitalization rates | Aug 10, 2020 | counterfactualDecision Making | —Unverified | 0 |
| Portfolio optimization with two coherent risk measures | Mar 25, 2019 | Decision MakingPortfolio Optimization | —Unverified | 0 |
| Portfolio Selection via Topological Data Analysis | Aug 15, 2023 | ClusteringDecision Making | —Unverified | 0 |
| Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video | Jul 14, 2022 | Decision MakingDiagnostic | —Unverified | 0 |
| Position: Bayesian Statistics Facilitates Stakeholder Participation in Evaluation of Generative AI | Apr 21, 2025 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Position: Emergent Machina Sapiens Urge Rethinking Multi-Agent Paradigms | Feb 5, 2025 | Decision MakingPosition | —Unverified | 0 |
| Position: Empowering Time Series Reasoning with Multimodal LLMs | Feb 3, 2025 | Decision MakingMultimodal Reasoning | —Unverified | 0 |
| Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability | Nov 7, 2024 | Decision MakingDiagnostic | —Unverified | 0 |
| Position Paper: Rethinking Privacy in RL for Sequential Decision-making in the Age of LLMs | Apr 15, 2025 | Autonomous VehiclesDecision Making | —Unverified | 0 |