| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation | Oct 30, 2024 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets | Nov 19, 2023 | ManagementOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Imbalanced Datasets | Jul 6, 2023 | D4RLOffline RL | —Unverified | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 |
| Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling | Dec 16, 2022 | MuJoCoQ-Learning | —Unverified | 0 |
| Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization | Sep 30, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| Off-policy Multi-step Q-learning | Sep 25, 2019 | Q-Learning | —Unverified | 0 |
| On Assessing The Safety of Reinforcement Learning algorithms Using Formal Methods | Nov 8, 2021 | Autonomous VehiclesQ-Learning | —Unverified | 0 |
| On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process | Feb 25, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| On-board Deep Q-Network for UAV-assisted Online Power Transfer and Data Collection | Jun 4, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly Communicating MDPs | Sep 30, 2022 | Q-Learning | —Unverified | 0 |
| On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes | Aug 29, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| On Decentralizing Federated Reinforcement Learning in Multi-Robot Scenarios | Jul 19, 2022 | Federated LearningQ-Learning | —Unverified | 0 |
| On Designing Multi-UAV aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning | Dec 13, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On Global Convergence Rates for Federated Policy Gradient under Heterogeneous Environment | May 29, 2025 | Federated LearningPolicy Gradient Methods | —Unverified | 0 |
| On Information Asymmetry in Competitive Multi-Agent Reinforcement Learning: Convergence and Optimality | Oct 21, 2020 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Online Adaptive Optimal Control Algorithm Based on Synchronous Integral Reinforcement Learning With Explorations | May 19, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Online Antenna Tuning in Heterogeneous Cellular Networks with Deep Reinforcement Learning | Mar 15, 2019 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On-line Building Energy Optimization using Deep Reinforcement Learning | Jul 18, 2017 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Online Frequency Scheduling by Learning Parallel Actions | Jun 7, 2024 | Graph Neural NetworkQ-Learning | —Unverified | 0 |
| Online inductive learning from answer sets for efficient reinforcement learning exploration | Jan 13, 2025 | Inductive LearningInductive logic programming | —Unverified | 0 |
| Online Learning for Offloading and Autoscaling in Energy Harvesting Mobile Edge Computing | Mar 17, 2017 | Edge-computingManagement | —Unverified | 0 |
| Online Robust Reinforcement Learning with Model Uncertainty | Sep 29, 2021 | modelQ-Learning | —Unverified | 0 |
| Online Statistical Inference for Nonlinear Stochastic Approximation with Markovian Data | Feb 15, 2023 | Q-Learningvalid | —Unverified | 0 |
| Asymptotic Analysis of Sample-averaged Q-learning | Oct 14, 2024 | OpenAI GymQ-Learning | —Unverified | 0 |
| Online Target Q-learning with Reverse Experience Replay: Efficiently finding the Optimal Policy for Linear MDPs | Oct 16, 2021 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 |
| Online Transfer Learning in Reinforcement Learning Domains | Jul 2, 2015 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Online waveform selection for cognitive radar | Oct 14, 2024 | Q-Learning | —Unverified | 0 |
| On optimal tracking portfolio in incomplete markets: The reinforcement learning approach | Nov 24, 2023 | Q-Learning | —Unverified | 0 |
| On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm | May 11, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| On the Convergence and Sample Complexity Analysis of Deep Q-Networks with ε-Greedy Exploration | Oct 24, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| On the Convergence of Approximate and Regularized Policy Iteration Schemes | Sep 20, 2019 | Q-LearningReinforcement Learning | —Unverified | 0 |
| On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs | Sep 7, 2022 | Open-Ended Question AnsweringQ-Learning | —Unverified | 0 |
| On the Convergence Rates of Federated Q-Learning across Heterogeneous Environments | Sep 5, 2024 | Q-Learning | —Unverified | 0 |
| On the Global Convergence of Fitted Q-Iteration with Two-layer Neural Network Parametrization | Nov 14, 2022 | Decision MakingQ-Learning | —Unverified | 0 |
| On the Reduction of Variance and Overestimation of Deep Q-Learning | Oct 14, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 |
| OPA-Pack: Object-Property-Aware Robotic Bin Packing | May 19, 2025 | ObjectQ-Learning | —Unverified | 0 |
| OpenSense: An Open-World Sensing Framework for Incremental Learning and Dynamic Sensor Scheduling on Embedded Edge Devices | Nov 29, 2023 | Incremental LearningQ-Learning | —Unverified | 0 |
| Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning | Jan 1, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal and Fast Real-time Resources Slicing with Deep Dueling Neural Networks | Feb 26, 2019 | Combinatorial OptimizationQ-Learning | —Unverified | 0 |
| Optimal Beam Association for High Mobility mmWave Vehicular Networks: Lightweight Parallel Reinforcement Learning Approach | May 2, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Optimal Control of District Cooling Energy Plant with Reinforcement Learning and MPC | Oct 5, 2023 | Model Predictive ControlQ-Learning | —Unverified | 0 |
| Optimal coordination of resources: A solution from reinforcement learning | Dec 20, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Cycling of a Heterogenous Battery Bank via Reinforcement Learning | Sep 15, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 |
| Optimal Decision-Making in Mixed-Agent Partially Observable Stochastic Environments via Reinforcement Learning | Jan 4, 2019 | Decision MakingImage Segmentation | —Unverified | 0 |
| Optimal Demand Response Using Device Based Reinforcement Learning | Jan 8, 2014 | energy managementManagement | —Unverified | 0 |
| Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services | Mar 12, 2024 | Q-Learning | —Unverified | 0 |