| Play with Emotion: Affect-Driven Reinforcement Learning | Aug 26, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Plex: Towards Reliability using Pretrained Large Model Extensions | Jul 15, 2022 | Active LearningDecision Making | —Unverified | 0 | 0 |
| POETREE: Interpretable Policy Learning with Adaptive Decision Trees | Mar 15, 2022 | Decision Making | —Unverified | 0 | 0 |
| POI Alias Discovery in Delivery Addresses using User Locations | Sep 20, 2021 | Decision Making | —Unverified | 0 | 0 |
| Point-Based Value Iteration for POMDPs with Neural Perception Mechanisms | Jun 30, 2023 | Collision AvoidanceDecision Making | —Unverified | 0 | 0 |
| PolarNet: Accelerated Deep Open Space Segmentation Using Automotive Radar in Polar Domain | Mar 4, 2021 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| Polar-Net: A Clinical-Friendly Model for Alzheimer's Disease Detection in OCTA Images | Nov 10, 2023 | Alzheimer's Disease DetectionDecision Making | —Unverified | 0 | 0 |
| Policy Choice and Best Arm Identification: Asymptotic Analysis of Exploration Sampling | Sep 16, 2021 | Decision MakingExperimental Design | —Unverified | 0 | 0 |
| Policy consequences of the new neuroeconomic framework | Sep 11, 2024 | Decision MakingManagement | —Unverified | 0 | 0 |
| Policy-focused Agent-based Modeling using RL Behavioral Models | Jun 9, 2020 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Policy Gradients for Contextual Recommendations | Feb 12, 2018 | Decision MakingMulti-Armed Bandits | —Unverified | 0 | 0 |
| Policy-Gradient Training of Language Models for Ranking | Oct 6, 2023 | Decision MakingDomain Generalization | —Unverified | 0 | 0 |
| Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning | Sep 28, 2020 | Decision MakingManagement | —Unverified | 0 | 0 |
| Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment | Oct 3, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Policy Gradient With Serial Markov Chain Reasoning | Oct 13, 2022 | Decision MakingMuJoCo | —Unverified | 0 | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 | 0 |
| Policy-labeled Preference Learning: Is Preference Enough for RLHF? | May 6, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Policy Learning for Domain Selection in an Extensible Multi-domain Spoken Dialogue System | Oct 1, 2014 | Decision MakingDialogue Management | —Unverified | 0 | 0 |
| Policy Learning with a Natural Language Action Space: A Causal Approach | Feb 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Policy Learning with Asymmetric Counterfactual Utilities | Jun 21, 2022 | counterfactualDecision Making | —Unverified | 0 | 0 |
| Policy Optimization Using Semi-parametric Models for Dynamic Pricing | Sep 13, 2021 | Decision Making | —Unverified | 0 | 0 |
| Policy Optimization with Model-based Explorations | Nov 18, 2018 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Policy Regularization for Legible Behavior | Mar 8, 2022 | Decision Making | —Unverified | 0 | 0 |
| Policy Trees for Prediction: Interpretable and Adaptive Model Selection for Machine Learning | May 30, 2024 | Decision MakingModel Selection | —Unverified | 0 | 0 |
| Polynomial Regret Concentration of UCB for Non-Deterministic State Transitions | Feb 9, 2025 | Decision Making | —Unverified | 0 | 0 |