| The Virtues of Pessimism in Inverse Reinforcement Learning | Feb 4, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning | Feb 4, 2024 | Meta Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Adaptive Q-Aid for Conditional Supervised Learning in Offline Reinforcement Learning | Feb 3, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update | Feb 1, 2024 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Context-Former: Stitching via Latent Conditioned Sequence Modeling | Jan 29, 2024 | D4RLDecision Making | —Unverified | 0 |
| Multi-Object Navigation in real environments using hybrid policies | Jan 24, 2024 | Imitation LearningObject | —Unverified | 0 |
| Differentiable Tree Search Network | Jan 22, 2024 | Decision MakingInductive Bias | CodeCode Available | 5 |
| Solving Offline Reinforcement Learning with Decision Tree Regression | Jan 21, 2024 | D4RLFeature Importance | CodeCode Available | 0 |
| MoMA: Model-based Mirror Ascent for Offline Reinforcement Learning | Jan 21, 2024 | Decision MakingOffline RL | —Unverified | 0 |
| Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model | Jan 19, 2024 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Harnessing Density Ratios for Online Reinforcement Learning | Jan 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning | Jan 17, 2024 | Offline RLRobot Manipulation | CodeCode Available | 0 |
| Learning from Sparse Offline Datasets via Conservative Density Estimation | Jan 16, 2024 | D4RLDensity Estimation | CodeCode Available | 0 |
| Solving Continual Offline Reinforcement Learning with Decision Transformer | Jan 16, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Optimistic Model Rollouts for Pessimistic Offline Policy Optimization | Jan 11, 2024 | modelOffline RL | —Unverified | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning | Jan 6, 2024 | Offline RLRobot Manipulation | —Unverified | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Policy-regularized Offline Multi-objective Reinforcement Learning | Jan 4, 2024 | Multi-Objective Reinforcement LearningOffline RL | CodeCode Available | 0 |
| POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning | Jan 1, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning | Jan 1, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Online Symbolic Music Alignment with Offline Reinforcement Learning | Dec 31, 2023 | Dynamic Time WarpingOffline RL | CodeCode Available | 1 |
| PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning | Dec 26, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |