| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 | 5 |
| Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning | Nov 29, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Offline Equilibrium Finding | Jul 12, 2022 | Offline RL | CodeCode Available | 0 | 5 |
| Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees | Nov 14, 2023 | Offline RL | CodeCode Available | 0 | 5 |
| Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning | Oct 16, 2023 | ChatbotOffline RL | CodeCode Available | 0 | 5 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 | 5 |
| Multi-Game Decision Transformers | May 30, 2022 | Atari GamesOffline RL | CodeCode Available | 0 | 5 |
| MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Feb 20, 2024 | Decision MakingOffline RL | CodeCode Available | 0 | 5 |