| Optimal Matrix Momentum Stochastic Approximation and Applications to Q-learning | Sep 17, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Optimal Path Planning and Cost Minimization for a Drone Delivery System Via Model Predictive Control | Mar 25, 2025 | Model Predictive ControlMulti-agent Reinforcement Learning | —Unverified | 0 |
| Optimal Transport-Assisted Risk-Sensitive Q-Learning | Jun 17, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Optimal Use of Experience in First Person Shooter Environments | Jun 24, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Optimal variance-reduced stochastic approximation in Banach spaces | Jan 21, 2022 | Q-Learning | —Unverified | 0 |
| Optimistic Exploration with Backward Bootstrapped Bonus for Deep Reinforcement Learning | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Optimistic Q-learning for average reward and episodic reinforcement learning | Jul 18, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimization of anemia treatment in hemodialysis patients via reinforcement learning | Sep 14, 2015 | Decision MakingQ-Learning | —Unverified | 0 |
| Optimized Monte Carlo Tree Search for Enhanced Decision Making in the FrozenLake Environment | Sep 25, 2024 | Decision MakingQ-Learning | —Unverified | 0 |
| Optimized Resource Allocation for Cloud-Native 6G Networks: Zero-Touch ML Models in Microservices-based VNF Deployments | Oct 9, 2024 | ManagementQ-Learning | —Unverified | 0 |
| Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning | Jun 27, 2023 | Decision MakingQ-Learning | —Unverified | 0 |
| Optimizing Load Scheduling in Power Grids Using Reinforcement Learning and Markov Decision Processes | Oct 23, 2024 | ManagementQ-Learning | —Unverified | 0 |
| Optimizing Returns Using the Hurst Exponent and Q Learning on Momentum and Mean Reversion Strategies | May 23, 2022 | Q-LearningTime Series | —Unverified | 0 |
| Optimizing TD3 for 7-DOF Robotic Arm Grasping: Overcoming Suboptimality with Exploration-Enhanced Contrastive Learning | Aug 26, 2024 | Contrastive LearningQ-Learning | —Unverified | 0 |
| Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for Pushing and Grasping | Apr 7, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Optimizing Wireless Resource Management and Synchronization in Digital Twin Networks | Feb 7, 2025 | ManagementQ-Learning | —Unverified | 0 |
| ORIENT: A Priority-Aware Energy-Efficient Approach for Latency-Sensitive Applications in 6G | Feb 10, 2024 | Q-Learning | —Unverified | 0 |
| Overcoming the Curse of Dimensionality in Reinforcement Learning Through Approximate Factorization | Nov 12, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| PAC Reinforcement Learning Algorithm for General-Sum Markov Games | Sep 5, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| PAIL: Performance based Adversarial Imitation Learning Engine for Carbon Neutral Optimization | Jul 12, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| PALMER: Perception-Action Loop with Memory for Long-Horizon Planning | Dec 8, 2022 | Q-LearningRepresentation Learning | —Unverified | 0 |
| Parallel bandit architecture based on laser chaos for reinforcement learning | May 19, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Parameterized MDPs and Reinforcement Learning Problems -- A Maximum Entropy Principle Based Framework | Jun 17, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Parameterized Reinforcement Learning for Optical System Optimization | Oct 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Partial Counterfactual Identification for Infinite Horizon Partially Observable Markov Decision Process | Aug 31, 2022 | counterfactualQ-Learning | —Unverified | 0 |
| Partially Detected Intelligent Traffic Signal Control: Environmental Adaptation | Oct 23, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams | Apr 3, 2019 | Hard Attentionobject-detection | —Unverified | 0 |
| Periodic agent-state based Q-learning for POMDPs | Jul 8, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Periodic Q-Learning | Feb 23, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Personalized Cancer Chemotherapy Schedule: a numerical comparison of performance and robustness in model-based and model-free scheduling methodologies | Apr 2, 2019 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach | Jan 1, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Personalized Medical Treatments Using Novel Reinforcement Learning Algorithms | Jun 16, 2014 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Feb 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Photonic architecture for reinforcement learning | Jul 17, 2019 | Active LearningQ-Learning | —Unverified | 0 |
| Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| PickLLM: Context-Aware RL-Assisted Large Language Model Routing | Dec 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PID Accelerated Temporal Difference Algorithms | Jul 11, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Planning and Learning in Average Risk-aware MDPs | Mar 22, 2025 | Q-Learning | —Unverified | 0 |
| Planning and Learning with Stochastic Action Sets | May 7, 2018 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Planning Irregular Object Packing via Hierarchical Reinforcement Learning | Nov 17, 2022 | Hierarchical Reinforcement LearningObject | —Unverified | 0 |
| Planning with RL and episodic-memory behavioral priors | Jul 5, 2022 | Imitation LearningQ-Learning | —Unverified | 0 |
| Playing a 2D Game Indefinitely using NEAT and Reinforcement Learning | Jul 28, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Playing against Nature: causal discovery for decision making under uncertainty | Jul 3, 2018 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Pointer Networks with Q-Learning for Combinatorial Optimization | Nov 5, 2023 | Combinatorial OptimizationGraph Embedding | —Unverified | 0 |
| Policy Learning with a Natural Language Action Space: A Causal Approach | Feb 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Polyphonic Music Composition: An Adversarial Inverse Reinforcement Learning Approach | Sep 29, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| PooL: Pheromone-inspired Communication Framework forLarge Scale Multi-Agent Reinforcement Learning | Feb 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Potential-Based Advice for Stochastic Policy Learning | Jul 20, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Potential Impacts of Smart Homes on Human Behavior: A Reinforcement Learning Approach | Feb 26, 2021 | Hierarchical Reinforcement LearningQ-Learning | —Unverified | 0 |