| OCMDP: Observation-Constrained Markov Decision Process | Nov 11, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| OFDM-Based Digital Semantic Communication with Importance Awareness | Jan 4, 2024 | Deep Reinforcement LearningSemantic Communication | —Unverified | 0 | 0 |
| Offline Deep Reinforcement Learning for Dynamic Pricing of Consumer Credit | Mar 6, 2022 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Offline Imitation Learning Through Graph Search and Retrieval | Jul 22, 2024 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 | 0 |
| Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning | Feb 14, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Off-Policy Deep Reinforcement Learning by Bootstrapping the Covariate Shift | Jan 27, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Policy Evaluation via Off-Policy Classification | Jun 4, 2019 | ClassificationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift | Nov 16, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |