| Explaining RL Decisions with Trajectories | May 6, 2023 | Attributecontinuous-control | CodeCode Available | 0 |
| Simple Noisy Environment Augmentation for Reinforcement Learning | May 4, 2023 | Data AugmentationDiversity | CodeCode Available | 0 |
| Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL | Apr 30, 2023 | Decision MakingMuJoCo | CodeCode Available | 1 |
| Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning | Apr 29, 2023 | Meta Reinforcement LearningMuJoCo | —Unverified | 0 |
| Feudal Graph Reinforcement Learning | Apr 11, 2023 | Decision MakingGraph Clustering | CodeCode Available | 0 |
| Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations | Mar 29, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint | Mar 8, 2023 | MuJoCo | —Unverified | 0 |
| Controlled Diversity with Preference : Towards Learning a Diverse Set of Desired Skills | Mar 7, 2023 | DiversityMuJoCo | CodeCode Available | 0 |
| A Strategy-Oriented Bayesian Soft Actor-Critic Model | Mar 7, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control | Mar 4, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting | Mar 2, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Meta-Reinforcement Learning via Exploratory Task Clustering | Feb 15, 2023 | ClusteringMeta Reinforcement Learning | —Unverified | 0 |
| When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Feb 15, 2023 | Autonomous Drivingcontinuous-control | CodeCode Available | 1 |
| Order Matters: Agent-by-agent Policy Optimization | Feb 13, 2023 | MuJoCo | CodeCode Available | 1 |
| Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning | Feb 13, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning | Feb 9, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization | Feb 5, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Online Reinforcement Learning in Non-Stationary Context-Driven Environments | Feb 4, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners | Feb 3, 2023 | DiversityMuJoCo | CodeCode Available | 1 |
| Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO) | Feb 1, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Neural Episodic Control with State Abstraction | Jan 27, 2023 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |