| Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning | Nov 13, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| OCMDP: Observation-Constrained Markov Decision Process | Nov 11, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| OFDM-Based Digital Semantic Communication with Importance Awareness | Jan 4, 2024 | Deep Reinforcement LearningSemantic Communication | —Unverified | 0 |
| Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit | Mar 6, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Offline Imitation Learning Through Graph Search and Retrieval | Jul 22, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 |
| Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift | Jan 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift | Nov 16, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | Dec 26, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| OIDM: An Observability-based Intelligent Distributed Edge Sensing Method for Industrial Cyber-Physical Systems | Sep 13, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras | Mar 2, 2019 | Deep Reinforcement LearningPedestrian Detection | —Unverified | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Connections between Constrained Optimization and Reinforcement Learning | Oct 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On-Demand Model and Client Deployment in Federated Learning with Deep Reinforcement Learning | May 12, 2024 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 |
| On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning | Dec 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Double Descent in Reinforcement Learning with LSTD and Random Features | Oct 9, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| One for Many: Transfer Learning for Building HVAC Control | Aug 9, 2020 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 |
| One is More: Diverse Perspectives within a Single Network for Efficient DRL | Oct 21, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning | Oct 25, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reducing Learning Difficulties: One-Step Two-Critic Deep Reinforcement Learning for Inverter-based Volt-Var Control | Mar 30, 2022 | Deep Reinforcement Learning | —Unverified | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 17, 2018 | Atari GamesDecision Making | —Unverified | 0 |
| On Inductive Biases in Deep Reinforcement Learning | Jul 5, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| On Jointly Optimizing Partial Offloading and SFC Mapping: A Cooperative Dual-agent Deep Reinforcement Learning Approach | May 20, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Online Antenna Tuning in Heterogeneous Cellular Networks with Deep Reinforcement Learning | Mar 15, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On-line Building Energy Optimization using Deep Reinforcement Learning | Jul 18, 2017 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Online Contrastive Divergence with Generative Replay: Experience Replay without Storing Data | Oct 18, 2016 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Data Poisoning Attack | Mar 5, 2019 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 |
| Online Data Poisoning Attacks | Jun 8, 2020 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 |
| Online Deep Reinforcement Learning for Autonomous UAV Navigation and Exploration of Outdoor Environments | Dec 11, 2019 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| Online Meta-learning by Parallel Algorithm Competition | Feb 24, 2017 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Online Model Selection for Reinforcement Learning with Function Approximation | Nov 19, 2020 | Deep Reinforcement LearningModel Selection | —Unverified | 0 |
| Online Multimodal Transportation Planning using Deep Reinforcement Learning | May 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Policy Distillation with Decision-Attention | Jun 8, 2024 | Deep Reinforcement LearningKnowledge Distillation | —Unverified | 0 |
| Online Robustness Training for Deep Reinforcement Learning | Nov 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Robust Policy Learning in the Presence of Unknown Adversaries | Jul 16, 2018 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Online Safety Assurance for Deep Reinforcement Learning | Oct 7, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Safety Property Collection and Refinement for Safe Deep Reinforcement Learning in Mapless Navigation | Feb 13, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Online Service Provisioning in NFV-enabled Networks Using Deep Reinforcement Learning | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Task Scheduling for Fog Computing with Multi-Resource Fairness | Aug 1, 2020 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Online Trading Models with Deep Reinforcement Learning in the Forex Market Considering Transaction Costs | Jun 6, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning | May 4, 2021 | Behavioural cloningDeep Reinforcement Learning | —Unverified | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| On Neural Consolidation for Transfer in Reinforcement Learning | Oct 5, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On-Policy Deep Reinforcement Learning for the Average-Reward Criterion | Jun 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On Reducing Undesirable Behavior in Deep Reinforcement Learning Models | Sep 6, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| On Reward Shaping for Mobile Robot Navigation: A Reinforcement Learning and SLAM Based Approach | Feb 10, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |