| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies | Jan 30, 2023 | Data AugmentationFeature Engineering | CodeCode Available | 0 |
| Learning to View: Decision Transformers for Active Object Detection | Jan 23, 2023 | Active Object DetectionMotion Planning | —Unverified | 0 |
| Benchmarks and Algorithms for Offline Preference-Based Reward Learning | Jan 3, 2023 | Active LearningOffline RL | —Unverified | 0 |
| Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives | Jan 3, 2023 | Offline RLRecommendation Systems | —Unverified | 0 |
| Offline Policy Optimization in RL with Variance Regularizaton | Dec 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Representation Learning in Deep RL via Discrete Information Bottleneck | Dec 28, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies | Dec 15, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Dec 8, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation | Dec 5, 2022 | BenchmarkingBinary Classification | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Policy Evaluation and Optimization under Confounding | Nov 29, 2022 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Offline Reinforcement Learning with Closed-Form Policy Improvement Operators | Nov 29, 2022 | D4RLForm | —Unverified | 0 |
| Behavior Estimation from Multi-Source Data for Offline Reinforcement Learning | Nov 29, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| Domain Generalization for Robust Model-Based Offline Reinforcement Learning | Nov 27, 2022 | Domain GeneralizationOffline RL | —Unverified | 0 |
| On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation | Nov 23, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning | Nov 21, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Offline Reinforcement Learning with Adaptive Behavior Regularization | Nov 15, 2022 | D4RLOffline RL | —Unverified | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| Leveraging Offline Data in Online Reinforcement Learning | Nov 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |