| Accelerating exploration and representation learning with offline pre-training | Mar 31, 2023 | Decision MakingNetHack | —Unverified | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Probabilistic inverse optimal control for non-linear partially observable systems disentangles perceptual uncertainty and behavioral costs | Mar 29, 2023 | Active LearningDecision Making | CodeCode Available | 0 |
| Boosting Reinforcement Learning and Planning with Demonstrations: A Survey | Mar 23, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Reflexion: Language Agents with Verbal Reinforcement Learning | Mar 20, 2023 | Decision MakingHumanEval | CodeCode Available | 4 |
| Welfare Maximization Algorithm for Solving Budget-Constrained Multi-Component POMDPs | Mar 18, 2023 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP | Mar 16, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Latent-Conditioned Policy Gradient for Multi-Objective Deep Reinforcement Learning | Mar 15, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 |