| Pragmatic Implementation of Reinforcement Algorithms For Path Finding On Raspberry Pi | Dec 7, 2021 | Collision AvoidanceQ-Learning | —Unverified | 0 |
| Predicting the Need for Blood Transfusion in Intensive Care Units with Reinforcement Learning | Jun 26, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| Predictive Crypto-Asset Automated Market Making Architecture for Decentralized Finance using Deep Reinforcement Learning | Sep 28, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Prelimit Coupling and Steady-State Convergence of Constant-stepsize Nonsmooth Contractive SA | Apr 9, 2024 | Q-Learning | —Unverified | 0 |
| Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity | Jan 1, 2021 | DiversityQ-Learning | —Unverified | 0 |
| Principal-Agent Reinforcement Learning: Orchestrating AI Agents with Contracts | Jul 25, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays | Feb 15, 2018 | HippocampusQ-Learning | —Unverified | 0 |
| Privacy-Cost Management in Smart Meters with Mutual Information-Based Reinforcement Learning | Jun 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Privacy-Cost Management in Smart Meters Using Deep Reinforcement Learning | Mar 10, 2020 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Probabilistic Curriculum Learning for Goal-Based Reinforcement Learning | Apr 2, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning | Nov 25, 2023 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Jan 15, 2025 | D4RLQ-Learning | —Unverified | 0 |
| Projective simulation for classical learning agents: a comprehensive investigation | May 7, 2013 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Prospect-theoretic Q-learning | Apr 12, 2021 | Q-Learning | —Unverified | 0 |
| Prospect Theory-inspired Automated P2P Energy Trading with Q-learning-based Dynamic Pricing | Aug 26, 2022 | energy tradingQ-Learning | —Unverified | 0 |
| Protein Structure Prediction in the 3D HP Model Using Deep Reinforcement Learning | Dec 29, 2024 | Deep Reinforcement LearningProtein Structure Prediction | —Unverified | 0 |
| Provable Multi-Objective Reinforcement Learning with Generative Models | Nov 19, 2020 | Multi-Objective Reinforcement LearningQ-Learning | —Unverified | 0 |
| Provable Reinforcement Learning for Networked Control Systems with Stochastic Packet Disordering | Dec 5, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation | Feb 25, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Provably Efficient Kernelized Q-Learning | Apr 21, 2022 | Q-Learning | —Unverified | 0 |
| Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication | Oct 14, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle | Jun 14, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle | Dec 1, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Provably Efficient Q-Learning with Low Switching Cost | May 30, 2019 | Q-Learning | —Unverified | 0 |
| Provably Efficient Reinforcement Learning with Aggregated States | Dec 13, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games | Oct 12, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Provably More Efficient Q-Learning in the One-Sided-Feedback/Full-Feedback Settings | Jun 30, 2020 | Q-Learning | —Unverified | 0 |
| Direct Data-Driven Discrete-time Bilinear Biquadratic Regulator | Aug 29, 2022 | Q-Learning | —Unverified | 0 |
| Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care | Jun 13, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Pseudorehearsal in value function approximation | Mar 21, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Learned Collusion | Apr 25, 2023 | Q-Learning | —Unverified | 0 |
| Q-Cogni: An Integrated Causal Reinforcement Learning Framework | Feb 26, 2023 | Causal InferenceDecision Making | —Unverified | 0 |
| Q-CP: Learning Action Values for Cooperative Planning | Mar 1, 2018 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-DATA: Enhanced Traffic Flow Monitoring in Software-Defined Networks applying Q-learning | Sep 4, 2019 | ManagementQ-Learning | —Unverified | 0 |
| QF-tuner: Breaking Tradition in Reinforcement Learning | Feb 26, 2024 | OpenAI GymQ-Learning | —Unverified | 0 |
| Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning | Jul 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-greedyUCB: a New Exploration Policy for Adaptive and Resource-efficient Scheduling | Jun 10, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Efficient Off-Policy Reinforcement Learning via Brain-Inspired Computing | May 14, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| QLAMMP: A Q-Learning Agent for Optimizing Fees on Automated Market Making Protocols | Nov 28, 2022 | Q-Learning | —Unverified | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Q-Learning Algorithm for VoLTE Closed-Loop Power Control in Indoor Small Cells | Jul 10, 2017 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Q-learning as a monotone scheme | May 30, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Q-learning Assisted Energy-Aware Traffic Offloading and Cell Switching in Heterogeneous Networks | Sep 11, 2019 | Q-Learning | —Unverified | 0 |
| Q-Learning Based Aerial Base Station Placement for Fairness Enhancement in Mobile Networks | Sep 10, 2019 | FairnessQ-Learning | —Unverified | 0 |
| Q-learning-based Hierarchical Cooperative Local Search for Steelmaking-continuous Casting Scheduling Problem | Jun 10, 2025 | Q-LearningScheduling | —Unverified | 0 |
| Q-learning-based Model-free Safety Filter | Nov 29, 2024 | modelQ-Learning | —Unverified | 0 |
| Q-learning Based Optimal False Data Injection Attack on Probabilistic Boolean Control Networks | Nov 29, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Q-Learning based system for path planning with unmanned aerial vehicles swarms in obstacle environments | Mar 30, 2023 | Q-Learning | —Unverified | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| A Distributed Intelligence Architecture for B5G Network Automation | Jul 28, 2021 | ManagementQ-Learning | —Unverified | 0 |