| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies | Jan 30, 2023 | Data AugmentationFeature Engineering | CodeCode Available | 0 |
| Learning to View: Decision Transformers for Active Object Detection | Jan 23, 2023 | Active Object DetectionMotion Planning | —Unverified | 0 |
| Benchmarks and Algorithms for Offline Preference-Based Reward Learning | Jan 3, 2023 | Active LearningOffline RL | —Unverified | 0 |
| Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives | Jan 3, 2023 | Offline RLRecommendation Systems | —Unverified | 0 |
| Offline Policy Optimization in RL with Variance Regularizaton | Dec 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Representation Learning in Deep RL via Discrete Information Bottleneck | Dec 28, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies | Dec 15, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Dec 8, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation | Dec 5, 2022 | BenchmarkingBinary Classification | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Policy Evaluation and Optimization under Confounding | Nov 29, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Offline Reinforcement Learning with Closed-Form Policy Improvement Operators | Nov 29, 2022 | D4RLForm | —Unverified | 0 |
| Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning | Nov 29, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Domain Generalization for Robust Model-Based Offline Reinforcement Learning | Nov 27, 2022 | Domain GeneralizationOffline RL | —Unverified | 0 |
| On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation | Nov 23, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning | Nov 21, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Offline Reinforcement Learning with Adaptive Behavior Regularization | Nov 15, 2022 | D4RLOffline RL | —Unverified | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| Leveraging Offline Data in Online Reinforcement Learning | Nov 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data | Nov 8, 2022 | Offline RL | —Unverified | 0 |
| Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning | Nov 6, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Contrastive Value Learning: Implicit Models for Simple Offline RL | Nov 3, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Oracle Inequalities for Model Selection in Offline Reinforcement Learning | Nov 3, 2022 | Model SelectionOffline RL | —Unverified | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 |
| Behavior Prior Representation learning for Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Dual Generator Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian | Nov 1, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Implicit Offline Reinforcement Learning via Supervised Learning | Oct 21, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning | Oct 20, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation | Oct 19, 2022 | D4RLMuJoCo | —Unverified | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data | Oct 16, 2022 | Model SelectionOffline RL | —Unverified | 0 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Oct 13, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 |
| The Role of Coverage in Online Reinforcement Learning | Oct 9, 2022 | Efficient ExplorationOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning | Sep 30, 2022 | Data AugmentationImage Generation | CodeCode Available | 0 |
| Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes | Sep 18, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Can Offline Reinforcement Learning Help Natural Language Understanding? | Sep 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation | Sep 14, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Task-Agnostic Learning to Accomplish New Tasks | Sep 9, 2022 | Imitation LearningOffline RL | —Unverified | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |