| Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning | Nov 13, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| OCMDP: Observation-Constrained Markov Decision Process | Nov 11, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| OFDM-Based Digital Semantic Communication with Importance Awareness | Jan 4, 2024 | Deep Reinforcement LearningSemantic Communication | —Unverified | 0 |
| Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit | Mar 6, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Imitation Learning Through Graph Search and Retrieval | Jul 22, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 |
| Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift | Jan 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift | Nov 16, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | Dec 26, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| OIDM: An Observability-based Intelligent Distributed Edge Sensing Method for Industrial Cyber-Physical Systems | Sep 13, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras | Mar 2, 2019 | Deep Reinforcement LearningPedestrian Detection | —Unverified | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Connections between Constrained Optimization and Reinforcement Learning | Oct 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On-Demand Model and Client Deployment in Federated Learning with Deep Reinforcement Learning | May 12, 2024 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning | Dec 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Double Descent in Reinforcement Learning with LSTD and Random Features | Oct 9, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| One for Many: Transfer Learning for Building HVAC Control | Aug 9, 2020 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 |
| One is More: Diverse Perspectives within a Single Network for Efficient DRL | Oct 21, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning | Oct 25, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reducing Learning Difficulties: One-Step Two-Critic Deep Reinforcement Learning for Inverter-based Volt-Var Control | Mar 30, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 17, 2018 | Atari GamesDecision Making | —Unverified | 0 |