| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 |
| Supported Policy Optimization for Offline Reinforcement Learning | Feb 13, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Flowformer: Linearizing Transformers with Conservation Flows | Feb 13, 2022 | D4RLOffline RL | CodeCode Available | 2 |
| Settling the Communication Complexity for Distributed Offline Reinforcement Learning | Feb 10, 2022 | Multi-Armed BanditsOffline RL | —Unverified | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning with Realizability and Single-policy Concentrability | Feb 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL | Feb 9, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Adversarially Trained Actor Critic for Offline Reinforcement Learning | Feb 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| How to Leverage Unlabeled Data in Offline Reinforcement Learning | Feb 3, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning | Jan 31, 2022 | DiversityOffline RL | CodeCode Available | 1 |