| COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks | Mar 16, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning | Mar 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency | Mar 3, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Reliable validation of Reinforcement Learning Benchmarks | Mar 2, 2022 | BenchmarkingData Compression | —Unverified | 0 |
| A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems | Mar 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Feb 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Settling the Communication Complexity for Distributed Offline Reinforcement Learning | Feb 10, 2022 | Multi-Armed BanditsOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Realizability and Single-policy Concentrability | Feb 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| How to Leverage Unlabeled Data in Offline Reinforcement Learning | Feb 3, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |