| Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models | Apr 15, 2025 | Humanoid ControlReinforcement Learning (RL) | CodeCode Available | 4 | 5 |
| ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters | May 4, 2022 | GPUImitation Learning | CodeCode Available | 3 | 5 |
| M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation | Jan 30, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| Explore and Control with Adversarial Surprise | Jul 12, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| Mastering the Unsupervised Reinforcement Learning Benchmark from Pixels | Sep 24, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| The Information Geometry of Unsupervised Reinforcement Learning | Oct 6, 2021 | Contrastive Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Skill-Based Reinforcement Learning with Intrinsic Reward Matching | Oct 14, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| Behavior From the Void: Unsupervised Active Pre-Training | Mar 8, 2021 | Atari GamesReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Choreographer: Learning and Adapting Skills in Imagination | Nov 23, 2022 | Unsupervised Reinforcement Learning | CodeCode Available | 1 | 5 |
| CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery | Feb 1, 2022 | Contrastive LearningDiversity | CodeCode Available | 1 | 5 |
| Self-Supervised Exploration via Disagreement | Jun 10, 2019 | Active LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Reinforcement Learning with Prototypical Representations | Feb 22, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A Mixture of Surprises for Unsupervised Reinforcement Learning | Oct 13, 2022 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 | 5 |
| Curiosity-driven Exploration by Self-supervised Prediction | May 15, 2017 | PredictionUnsupervised Reinforcement Learning | CodeCode Available | 1 | 5 |
| METRA: Scalable Unsupervised RL with Metric-Aware Abstraction | Oct 13, 2023 | Reinforcement Learning (RL)Unsupervised Pre-training | CodeCode Available | 1 | 5 |
| Diversity is All You Need: Learning Skills without a Reward Function | Feb 16, 2018 | AllDiversity | CodeCode Available | 1 | 5 |
| URLB: Unsupervised Reinforcement Learning Benchmark | Oct 28, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning | Jul 20, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Emergent Real-World Robotic Skills via Unsupervised Off-Policy Reinforcement Learning | Apr 27, 2020 | Model Predictive Controlreinforcement-learning | CodeCode Available | 1 | 5 |
| Exploration by Random Network Distillation | Oct 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Unsupervised Reinforcement Learning in Multiple Environments | Dec 16, 2021 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| ComSD: Balancing Behavioral Quality and Diversity in Unsupervised Skill Discovery | Sep 29, 2023 | Contrastive LearningDiversity | CodeCode Available | 0 | 5 |
| CRC-RL: A Novel Visual Feature Representation Architecture for Unsupervised Reinforcement Learning | Jan 31, 2023 | Decoderreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Exploration via State Marginal Matching | Jun 12, 2019 | Efficient ExplorationReinforcement Learning | CodeCode Available | 0 | 5 |
| Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and Explorations | Aug 4, 2022 | Efficient ExplorationUnsupervised Reinforcement Learning | CodeCode Available | 0 | 5 |
| SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments | Dec 11, 2019 | Navigatereinforcement-learning | CodeCode Available | 0 | 5 |
| Surprise-Adaptive Intrinsic Motivation for Unsupervised Reinforcement Learning | May 27, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| Unsupervised multi-latent space reinforcement learning framework for video summarization in ultrasound imaging | Sep 3, 2021 | reinforcement-learningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Variational Intrinsic Control | Nov 22, 2016 | Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Exploration with Principles for Diverse AI Supervision | Oct 13, 2023 | Reinforcement Learning (RL)Unsupervised Reinforcement Learning | —Unverified | 0 | 0 |
| Machine Learning for Intelligent Authentication in 5G-and-Beyond Wireless Networks | Jun 30, 2019 | BIG-bench Machine LearningReinforcement Learning | —Unverified | 0 | 0 |
| EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model | Oct 2, 2022 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Curiosity & Entropy Driven Unsupervised RL in Multiple Environments | Jan 8, 2024 | Unsupervised Reinforcement Learning | —Unverified | 0 | 0 |
| Palm up: Playing in the Latent Manifold for Unsupervised Pretraining | Oct 19, 2022 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning | May 23, 2022 | Open-Ended Question Answeringreinforcement-learning | —Unverified | 0 | 0 |
| Constrained Ensemble Exploration for Unsupervised Skill Discovery | May 25, 2024 | Reinforcement Learning (RL)Unsupervised Reinforcement Learning | —Unverified | 0 | 0 |
| Rewardless Open-Ended Learning (ROEL) | Sep 29, 2021 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| CLIC: Curriculum Learning and Imitation for object Control in non-rewarding environments | Jan 28, 2019 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions | Oct 24, 2024 | DiversityInductive Bias | —Unverified | 0 | 0 |
| AutoQD: Automatic Discovery of Diverse Behaviors with Quality-Diversity Optimization | Jun 5, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A Framework for Provably Stable and Consistent Training of Deep Feedforward Networks | May 20, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| SMiRL: Surprise Minimizing RL in Entropic Environments | Sep 25, 2019 | Unsupervised Pre-trainingUnsupervised Reinforcement Learning | —Unverified | 0 | 0 |
| Variational Intrinsic Control Revisited | Oct 7, 2020 | reinforcement-learningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Augmenting Unsupervised Reinforcement Learning with Self-Reference | Nov 16, 2023 | Attributereinforcement-learning | —Unverified | 0 | 0 |
| Unsupervised Discovery of Continuous Skills on a Sphere | May 21, 2023 | MuJoCoUnsupervised Reinforcement Learning | —Unverified | 0 | 0 |
| Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement Learning | Oct 25, 2021 | Domain Adaptationreinforcement-learning | —Unverified | 0 | 0 |
| APS: Active Pretraining with Successor Features | Aug 31, 2021 | Unsupervised Reinforcement Learning | —Unverified | 0 | 0 |
| Wasserstein Unsupervised Reinforcement Learning | Oct 15, 2021 | Hierarchical Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery | Apr 29, 2022 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Task Adaptation from Skills: Information Geometry, Disentanglement, and New Objectives for Unsupervised Reinforcement Learning | Jun 12, 2025 | DisentanglementDiversity | —Unverified | 0 | 0 |