| Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning | Feb 4, 2024 | Meta Reinforcement LearningOffline RL | CodeCode Available | 1 |
| The Virtues of Pessimism in Inverse Reinforcement Learning | Feb 4, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning | Feb 3, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update | Feb 1, 2024 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Context-Former: Stitching via Latent Conditioned Sequence Modeling | Jan 29, 2024 | D4RLDecision Making | —Unverified | 0 |
| Multi-Object Navigation in real environments using hybrid policies | Jan 24, 2024 | Imitation LearningObject | —Unverified | 0 |
| Differentiable Tree Search Network | Jan 22, 2024 | Decision MakingInductive Bias | CodeCode Available | 5 |
| Solving Offline Reinforcement Learning with Decision Tree Regression | Jan 21, 2024 | D4RLFeature Importance | CodeCode Available | 0 |
| MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning | Jan 21, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Jan 19, 2024 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Harnessing Density Ratios for Online Reinforcement Learning | Jan 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning | Jan 17, 2024 | Offline RLRobot Manipulation | CodeCode Available | 0 |
| Solving Continual Offline Reinforcement Learning with Decision Transformer | Jan 16, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Learning from Sparse Offline Datasets via Conservative Density Estimation | Jan 16, 2024 | D4RLDensity Estimation | CodeCode Available | 0 |
| Optimistic Model Rollouts for Pessimistic Offline Policy Optimization | Jan 11, 2024 | modelOffline RL | —Unverified | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 |
| MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning | Jan 6, 2024 | Offline RLRobot Manipulation | —Unverified | 0 |
| Policy-regularized Offline Multi-objective Reinforcement Learning | Jan 4, 2024 | Multi-Objective Reinforcement LearningOffline RL | CodeCode Available | 0 |
| POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning | Jan 1, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning | Jan 1, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Online Symbolic Music Alignment with Offline Reinforcement Learning | Dec 31, 2023 | Dynamic Time WarpingOffline RL | CodeCode Available | 1 |
| PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning | Dec 26, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Neural Network Approximation for Pessimistic Offline Reinforcement Learning | Dec 19, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| CUDC: A Curiosity-Driven Unsupervised Data Collection Method with Adaptive Temporal Distances for Offline Reinforcement Learning | Dec 19, 2023 | NavigateOffline RL | —Unverified | 0 |
| Advancing RAN Slicing with Offline Reinforcement Learning | Dec 16, 2023 | ManagementOffline RL | —Unverified | 0 |
| Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach | Dec 12, 2023 | Knowledge DistillationOffline RL | CodeCode Available | 1 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | CodeCode Available | 0 |
| The Generalization Gap in Offline Reinforcement Learning | Dec 10, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization | Dec 7, 2023 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 |
| MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator | Dec 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Dec 6, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Diffused Task-Agnostic Milestone Planner | Dec 6, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| H-GAP: Humanoid Control with a Generalist Planner | Dec 5, 2023 | Humanoid ControlModel Predictive Control | —Unverified | 0 |
| SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation | Nov 30, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 1 |
| Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective | Nov 29, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning | Nov 29, 2023 | AstronomyOffline RL | —Unverified | 0 |
| A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning | Nov 27, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets | Nov 19, 2023 | ManagementOffline RL | —Unverified | 0 |
| Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees | Nov 14, 2023 | Offline RL | CodeCode Available | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Offline RL with Observation Histories: Analyzing and Improving Sample Complexity | Oct 31, 2023 | Autonomous NavigationOffline RL | —Unverified | 0 |
| Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning | Oct 31, 2023 | Few-Shot LearningOffline RL | CodeCode Available | 1 |
| Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning | Oct 30, 2023 | Decision MakingOffline RL | CodeCode Available | 1 |
| Robust Offline Reinforcement learning with Heavy-Tailed Rewards | Oct 28, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |
| Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data Coverage | Oct 27, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |