| Observation Space Matters: Benchmark and Optimization Algorithm | Nov 2, 2020 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Observe and Look Further: Achieving Consistent Performance on Atari | May 29, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Observed Adversaries in Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators for Non-Repetitive Reaching Tasks | Feb 4, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning | Nov 13, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| OCMDP: Observation-Constrained Markov Decision Process | Nov 11, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| OFDM-Based Digital Semantic Communication with Importance Awareness | Jan 4, 2024 | Deep Reinforcement LearningSemantic Communication | —Unverified | 0 | 0 |
| Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit | Mar 6, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Offline Imitation Learning Through Graph Search and Retrieval | Jul 22, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 | 0 |
| Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift | Jan 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift | Nov 16, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | Dec 26, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| OIDM: An Observability-based Intelligent Distributed Edge Sensing Method for Industrial Cyber-Physical Systems | Sep 13, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 | 0 |
| OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras | Mar 2, 2019 | Deep Reinforcement LearningPedestrian Detection | —Unverified | 0 | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| On Connections between Constrained Optimization and Reinforcement Learning | Oct 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| On-Demand Model and Client Deployment in Federated Learning with Deep Reinforcement Learning | May 12, 2024 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 | 0 |
| On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning | Dec 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| On Double Descent in Reinforcement Learning with LSTD and Random Features | Oct 9, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| One for Many: Transfer Learning for Building HVAC Control | Aug 9, 2020 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 | 0 |