| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | Dec 26, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| OIDM: An Observability-based Intelligent Distributed Edge Sensing Method for Industrial Cyber-Physical Systems | Sep 13, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 | 0 |
| OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras | Mar 2, 2019 | Deep Reinforcement LearningPedestrian Detection | —Unverified | 0 | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| On Connections between Constrained Optimization and Reinforcement Learning | Oct 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| On-Demand Model and Client Deployment in Federated Learning with Deep Reinforcement Learning | May 12, 2024 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 | 0 |
| On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning | Dec 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| On Double Descent in Reinforcement Learning with LSTD and Random Features | Oct 9, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| One for Many: Transfer Learning for Building HVAC Control | Aug 9, 2020 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 | 0 |