| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Direct Preference-based Policy Optimization without Reward Modeling | Jan 30, 2023 | Contrastive LearningOffline RL | CodeCode Available | 1 |
| Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies | Jan 30, 2023 | Data AugmentationFeature Engineering | CodeCode Available | 0 |
| Guiding Online Reinforcement Learning with Action-Free Offline Pretraining | Jan 30, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Learning to View: Decision Transformers for Active Object Detection | Jan 23, 2023 | Active Object DetectionMotion Planning | —Unverified | 0 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives | Jan 3, 2023 | Offline RLRecommendation Systems | —Unverified | 0 |
| Benchmarks and Algorithms for Offline Preference-Based Reward Learning | Jan 3, 2023 | Active LearningOffline RL | —Unverified | 0 |
| Offline Policy Optimization in RL with Variance Regularizaton | Dec 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Representation Learning in Deep RL via Discrete Information Bottleneck | Dec 28, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning for Visual Navigation | Dec 16, 2022 | NavigateOffline RL | CodeCode Available | 1 |
| Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies | Dec 15, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Dec 8, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation | Dec 5, 2022 | BenchmarkingBinary Classification | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning | Nov 30, 2022 | AllDecision Making | CodeCode Available | 1 |
| Efficient Reinforcement Learning Through Trajectory Generation | Nov 30, 2022 | LEMMAOffline RL | CodeCode Available | 1 |
| Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning | Nov 29, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Offline Policy Evaluation and Optimization under Confounding | Nov 29, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Offline Reinforcement Learning with Closed-Form Policy Improvement Operators | Nov 29, 2022 | D4RLForm | —Unverified | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Domain Generalization for Robust Model-Based Offline Reinforcement Learning | Nov 27, 2022 | Domain GeneralizationOffline RL | —Unverified | 0 |
| Masked Autoencoding for Scalable and Generalizable Decision Making | Nov 23, 2022 | Decision MakingOffline RL | CodeCode Available | 1 |
| On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation | Nov 23, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning | Nov 21, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows | Nov 20, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size | Nov 20, 2022 | Offline RL | CodeCode Available | 1 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| Offline Reinforcement Learning with Adaptive Behavior Regularization | Nov 15, 2022 | D4RLOffline RL | —Unverified | 0 |
| Leveraging Offline Data in Online Reinforcement Learning | Nov 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data | Nov 8, 2022 | Offline RL | —Unverified | 0 |
| Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning | Nov 6, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Contrastive Value Learning: Implicit Models for Simple Offline RL | Nov 3, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Oracle Inequalities for Model Selection in Offline Reinforcement Learning | Nov 3, 2022 | Model SelectionOffline RL | —Unverified | 0 |
| Dual Generator Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 |
| Behavior Prior Representation learning for Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian | Nov 1, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Dungeons and Data: A Large-Scale NetHack Dataset | Nov 1, 2022 | Decision MakingNetHack | CodeCode Available | 2 |
| Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information | Oct 31, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Leveraging Demonstrations with Latent Space Priors | Oct 26, 2022 | Offline RL | CodeCode Available | 1 |
| Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning | Oct 25, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Implicit Offline Reinforcement Learning via Supervised Learning | Oct 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning | Oct 20, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| MoCoDA: Model-based Counterfactual Data Augmentation | Oct 20, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation | Oct 19, 2022 | D4RLMuJoCo | —Unverified | 0 |