| Neural-to-Tree Policy Distillation with Policy Improvement Criterion | Aug 16, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy | Dec 1, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Neuro-evolutionary Frameworks for Generalized Learning Agents | Feb 4, 2020 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 | 0 |
| Neuromechanics-based Deep Reinforcement Learning of Neurostimulation Control in FES cycling | Mar 4, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Neuromodulated Patience for Robot and Self-Driving Vehicle Navigation | Sep 14, 2019 | Deep Reinforcement LearningNavigate | —Unverified | 0 | 0 |
| Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning | Apr 9, 2025 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Neuroplastic Expansion in Deep Reinforcement Learning | Oct 10, 2024 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Neuroprospecting with DeepRL agents | Sep 24, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Neuro-Symbolic Reinforcement Learning with First-Order Logic | Oct 21, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Never Forget: Balancing Exploration and Exploitation via Learning Optical Flow | Jan 24, 2019 | Deep Reinforcement LearningOptical Flow Estimation | —Unverified | 0 | 0 |
| No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees | Aug 23, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Some approaches used to overcome overestimation in Deep Reinforcement Learning algorithms | Jun 25, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Noisy Spiking Actor Network for Exploration | Mar 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Non-Deterministic Policy Improvement Stabilizes Approximated Reinforcement Learning | Dec 22, 2016 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Non-Progressive Influence Maximization in Dynamic Social Networks | Dec 10, 2024 | Deep Reinforcement LearningDynamic graph embedding | —Unverified | 0 | 0 |
| Non-Robust Feature Mapping in Deep Reinforcement Learning | Jun 18, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| No-Regret Reinforcement Learning with Heavy-Tailed Rewards | Feb 25, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Normalization and effective learning rates in reinforcement learning | Jul 1, 2024 | Atari GamesContinual Learning | —Unverified | 0 | 0 |
| NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration | Jun 19, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| NTP-INT: Network Traffic Prediction-Driven In-band Network Telemetry for High-load Switches | Feb 18, 2025 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 | 0 |
| Nuclear Microreactor Control with Deep Reinforcement Learning | Mar 31, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Object Exchangeability in Reinforcement Learning: Extended Abstract | May 7, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 | 0 |
| Object Goal Navigation using Data Regularized Q-Learning | Aug 27, 2022 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Object-sensitive Deep Reinforcement Learning | Sep 17, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Objects matter: object-centric world models improve reinforcement learning in visually complex environments | Jan 27, 2025 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Observation Space Matters: Benchmark and Optimization Algorithm | Nov 2, 2020 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Observe and Look Further: Achieving Consistent Performance on Atari | May 29, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Observed Adversaries in Deep Reinforcement Learning | Oct 13, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators for Non-Repetitive Reaching Tasks | Feb 4, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Obstacle Avoidance for UAS in Continuous Action Space Using Deep Reinforcement Learning | Nov 13, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| OCMDP: Observation-Constrained Markov Decision Process | Nov 11, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| OFDM-Based Digital Semantic Communication with Importance Awareness | Jan 4, 2024 | Deep Reinforcement LearningSemantic Communication | —Unverified | 0 | 0 |
| Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit | Mar 6, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Offline Imitation Learning Through Graph Search and Retrieval | Jul 22, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 | 0 |
| Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift | Jan 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift | Nov 16, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Off-Policy Reinforcement Learning with Delayed Rewards | Jun 22, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Policy Reinforcement Learning with Loss Function Weighted by Temporal Difference Error | Dec 26, 2022 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| OIDM: An Observability-based Intelligent Distributed Edge Sensing Method for Industrial Cyber-Physical Systems | Sep 13, 2024 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 | 0 |
| OmniDRL: Robust Pedestrian Detection using Deep Reinforcement Learning on Omnidirectional Cameras | Mar 2, 2019 | Deep Reinforcement LearningPedestrian Detection | —Unverified | 0 | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| On Connections between Constrained Optimization and Reinforcement Learning | Oct 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| On-Demand Model and Client Deployment in Federated Learning with Deep Reinforcement Learning | May 12, 2024 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 | 0 |
| On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning | Dec 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| On Double Descent in Reinforcement Learning with LSTD and Random Features | Oct 9, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| One for Many: Transfer Learning for Building HVAC Control | Aug 9, 2020 | Deep Reinforcement LearningTransfer Learning | —Unverified | 0 | 0 |