| Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments | Aug 23, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity | Aug 11, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning at Multiple Frequencies | Jul 26, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion | Jul 16, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| GriddlyJS: A Web IDE for Reinforcement Learning | Jul 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Offline Equilibrium Finding | Jul 12, 2022 | Offline RL | CodeCode Available | 0 |
| Offline RL Policies Should be Trained to be Adaptive | Jul 5, 2022 | Offline RL | —Unverified | 0 |
| An Empirical Study of Implicit Regularization in Deep Offline RL | Jul 5, 2022 | Offline RL | —Unverified | 0 |
| Prompting Decision Transformer for Few-Shot Policy Generalization | Jun 27, 2022 | Few-Shot LearningInductive Bias | —Unverified | 0 |
| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 |
| Bootstrapped Transformer for Offline Reinforcement Learning | Jun 17, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Jun 16, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Contrastive Learning as Goal-Conditioned Reinforcement Learning | Jun 15, 2022 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning | Jun 14, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Provable Benefit of Multitask Representation Learning in Reinforcement Learning | Jun 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward | Jun 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Federated Offline Reinforcement Learning | Jun 11, 2022 | Offline RLPrivacy Preserving | —Unverified | 0 |
| Large-Scale Retrieval for Reinforcement Learning | Jun 10, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| On the Role of Discount Factor in Offline Reinforcement Learning | Jun 7, 2022 | D4RLOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Causal Structured World Models | Jun 3, 2022 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Differential Privacy | Jun 2, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Model Generation with Provable Coverability for Offline Reinforcement Learning | Jun 1, 2022 | Offline RLOut-of-Distribution Generalization | —Unverified | 0 |
| Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL | Jun 1, 2022 | D4RLOffline RL | —Unverified | 0 |
| Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game | May 31, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments | May 31, 2022 | Offline RLPlaying the Game of 2048 | —Unverified | 0 |
| Multi-Game Decision Transformers | May 30, 2022 | Atari GamesOffline RL | CodeCode Available | 0 |
| Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters | May 27, 2022 | D4RLOffline RL | —Unverified | 0 |
| Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes | May 26, 2022 | Causal InferenceOffline RL | —Unverified | 0 |
| User-Interactive Offline Reinforcement Learning | May 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation | May 6, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Pessimism meets VCG: Learning Dynamic Mechanism Design via Offline Reinforcement Learning | May 5, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers | Apr 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Learning Value Functions from Undirected State-only Experience | Apr 26, 2022 | Future predictionImitation Learning | —Unverified | 0 |
| When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning? | Apr 12, 2022 | Atari GamesDiagnostic | —Unverified | 0 |
| Settling the Sample Complexity of Model-Based Offline Reinforcement Learning | Apr 11, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies | Mar 25, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps | Mar 25, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Bellman Residual Orthogonalization for Offline Reinforcement Learning | Mar 24, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning | Mar 21, 2022 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Semi-Markov Offline Reinforcement Learning for Healthcare | Mar 17, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks | Mar 16, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement Learning | Mar 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency | Mar 3, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Reliable validation of Reinforcement Learning Benchmarks | Mar 2, 2022 | BenchmarkingData Compression | —Unverified | 0 |
| A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems | Mar 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity | Feb 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Settling the Communication Complexity for Distributed Offline Reinforcement Learning | Feb 10, 2022 | Multi-Armed BanditsOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Realizability and Single-policy Concentrability | Feb 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Transferred Q-learning | Feb 9, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| How to Leverage Unlabeled Data in Offline Reinforcement Learning | Feb 3, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |