| Particle Based Stochastic Policy Optimization | Sep 29, 2021 | Deep Reinforcement LearningMuJoCo Games | —Unverified | 0 |
| Pareto Policy Pool for Model-based Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Variational oracle guiding for reinforcement learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning | Sep 29, 2021 | Multi-Task LearningOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Resource Constrained Online Deployment | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Sep 16, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |