| Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning | Jun 27, 2023 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes | Oct 23, 2024 | ManagementQ-Learning | —Unverified | 0 | 0 |
| Optimizing Returns Using the Hurst Exponent and Q Learning on Momentum and Mean Reversion Strategies | May 23, 2022 | Q-LearningTime Series | —Unverified | 0 | 0 |
| Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning | Aug 26, 2024 | Contrastive LearningQ-Learning | —Unverified | 0 | 0 |
| Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for Pushing and Grasping | Apr 7, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Optimizing Wireless Resource Management and Synchronization in Digital Twin Networks | Feb 7, 2025 | ManagementQ-Learning | —Unverified | 0 | 0 |
| ORIENT: A Priority-Aware Energy-Efficient Approach for Latency-Sensitive Applications in 6G | Feb 10, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization | Nov 12, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| PAC Reinforcement Learning Algorithm for General-Sum Markov Games | Sep 5, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization | Jul 12, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| PALMER: Perception-Action Loop with Memory for Long-Horizon Planning | Dec 8, 2022 | Q-LearningRepresentation Learning | —Unverified | 0 | 0 |
| Parallel bandit architecture based on laser chaos for reinforcement learning | May 19, 2022 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework | Jun 17, 2020 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Parameterized Reinforcement Learning for Optical System Optimization | Oct 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Partial Counterfactual Identification for Infinite Horizon Partially Observable Markov Decision Process | Aug 31, 2022 | counterfactualQ-Learning | —Unverified | 0 | 0 |
| Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation | Oct 23, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams | Apr 3, 2019 | Hard Attentionobject-detection | —Unverified | 0 | 0 |
| Periodic agent-state based Q-learning for POMDPs | Jul 8, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Periodic Q-Learning | Feb 23, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies | Apr 2, 2019 | Deep Reinforcement Learningmodel | —Unverified | 0 | 0 |
| Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach | Jan 1, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Personalized Medical Treatments Using Novel Reinforcement Learning Algorithms | Jun 16, 2014 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Feb 28, 2022 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Photonic architecture for reinforcement learning | Jul 17, 2019 | Active LearningQ-Learning | —Unverified | 0 | 0 |
| Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| PickLLM: Context-Aware RL-Assisted Large Language Model Routing | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| PID Accelerated Temporal Difference Algorithms | Jul 11, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Planning and Learning in Average Risk-aware MDPs | Mar 22, 2025 | Q-Learning | —Unverified | 0 | 0 |
| Planning and Learning with Stochastic Action Sets | May 7, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Planning Irregular Object Packing via Hierarchical Reinforcement Learning | Nov 17, 2022 | Hierarchical Reinforcement LearningObject | —Unverified | 0 | 0 |
| Planning with RL and episodic-memory behavioral priors | Jul 5, 2022 | Imitation LearningQ-Learning | —Unverified | 0 | 0 |
| Playing a 2D Game Indefinitely using NEAT and Reinforcement Learning | Jul 28, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 | 0 |
| Pointer Networks with Q-Learning for Combinatorial Optimization | Nov 5, 2023 | Combinatorial OptimizationGraph Embedding | —Unverified | 0 | 0 |
| Policy Learning with a Natural Language Action Space: A Causal Approach | Feb 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Polyphonic Music Composition: An Adversarial Inverse Reinforcement Learning Approach | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning | Feb 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Potential-Based Advice for Stochastic Policy Learning | Jul 20, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 | 0 |
| Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach | Feb 26, 2021 | Hierarchical Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Pragmatic Implementation of Reinforcement Algorithms For Path Finding On Raspberry Pi | Dec 7, 2021 | Collision AvoidanceQ-Learning | —Unverified | 0 | 0 |
| Predicting the Need for Blood Transfusion in Intensive Care Units with Reinforcement Learning | Jun 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Predictive Crypto-Asset Automated Market Making Architecture for Decentralized Finance using Deep Reinforcement Learning | Sep 28, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA | Apr 9, 2024 | Q-Learning | —Unverified | 0 | 0 |
| Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity | Jan 1, 2021 | DiversityQ-Learning | —Unverified | 0 | 0 |
| Principal-Agent Reinforcement Learning: Orchestrating AI Agents with Contracts | Jul 25, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays | Feb 15, 2018 | HippocampusQ-Learning | —Unverified | 0 | 0 |
| Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning | Jun 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Privacy-Cost Management in Smart Meters Using Deep Reinforcement Learning | Mar 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning | Apr 2, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |