| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 |
| PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation | Jun 6, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| State Regularized Policy Optimization on Data with Dynamics Shift | Jun 6, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Survival Instinct in Offline Reinforcement Learning | Jun 5, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding | Jun 1, 2023 | ManagementOffline RL | —Unverified | 0 |
| Improving and Benchmarking Offline Reinforcement Learning Algorithms | Jun 1, 2023 | AttributeBenchmarking | CodeCode Available | 1 |
| Improving Offline RL by Blending Heuristics | Jun 1, 2023 | D4RLOffline RL | —Unverified | 0 |
| IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control | Jun 1, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning | Jun 1, 2023 | FairnessOffline RL | —Unverified | 0 |
| Efficient Diffusion Policies for Offline Reinforcement Learning | May 31, 2023 | D4RLOffline RL | CodeCode Available | 1 |