| CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels | Feb 19, 2021 | MuJoCo | —Unverified | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| Robust Policy Gradient against Strong Data Corruption | Feb 11, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Variance Penalized On-Policy and Off-Policy Actor-Critic | Feb 3, 2021 | MuJoCo | CodeCode Available | 0 |
| GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning | Jan 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments | Jan 20, 2021 | MuJoCo | CodeCode Available | 1 |
| Randomized Ensembled Double Q-Learning: Learning Fast Without a Model | Jan 15, 2021 | MuJoCoQ-Learning | CodeCode Available | 1 |
| Cross-Modal Domain Adaptation for Reinforcement Learning | Jan 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| Multi-Agent Trust Region Learning | Jan 1, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| CAT-SAC: Soft Actor-Critic with Curiosity-Aware Entropy Temperature | Jan 1, 2021 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Adaptive N-step Bootstrapping with Off-policy Data | Jan 1, 2021 | Atari GamesMuJoCo | —Unverified | 0 |
| TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control | Jan 1, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Invariant Representations for Reinforcement Learning without Reconstruction | Jan 1, 2021 | Causal InferenceMuJoCo | —Unverified | 0 |
| Intrinsically Guided Exploration in Meta Reinforcement Learning | Jan 1, 2021 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Self-Supervised Continuous Control without Policy Gradient | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Practical Marginalized Importance Sampling with the Successor Representation | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Formal Language Constrained Markov Decision Processes | Jan 1, 2021 | MuJoCo | —Unverified | 0 |
| MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning | Jan 1, 2021 | Efficient ExplorationMuJoCo | —Unverified | 0 |
| Hellinger Distance Constrained Regression | Jan 1, 2021 | MuJoCoregression | —Unverified | 0 |
| Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards | Dec 26, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| OPAC: Opportunistic Actor-Critic | Dec 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Reset-Free Lifelong Learning with Skill-Space Planning | Dec 7, 2020 | Lifelong learningMuJoCo | CodeCode Available | 1 |