SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 76100 of 514 papers

TitleStatusHype
Instance Temperature Knowledge DistillationCode0
Better Exploration with Optimistic Actor CriticCode0
Scalable Exploration via Ensemble++Code0
Large-Batch, Iteration-Efficient Neural Bayesian Design OptimizationCode0
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement LearningCode0
Angrier Birds: Bayesian reinforcement learningCode0
Hierarchical Spatial Proximity Reasoning for Vision-and-Language NavigationCode0
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgentCode0
Heterogeneous Multi-player Multi-armed Bandits: Closing the Gap and GeneralizationCode0
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement LearningCode0
Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic SystemsCode0
Goal-Reaching Policy Learning from Non-Expert Observations via Effective Subgoal GuidanceCode0
Data-Efficient Exploration, Optimization, and Modeling of Diverse Designs through Surrogate-Assisted IlluminationCode0
Bayesian Curiosity for Efficient Exploration in Reinforcement LearningCode0
Go Beyond Imagination: Maximizing Episodic Reachability with World ModelsCode0
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and ExplorationsCode0
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement LearningCode0
Amortized Variational Deep Q NetworkCode0
Generalization and Exploration via Randomized Value FunctionsCode0
Batch Bayesian Optimization via Local PenalizationCode0
Curiosity Driven Exploration of Learned Disentangled Goal SpacesCode0
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal BabblingCode0
Personalized Algorithmic Recourse with Preference ElicitationCode0
Curiosity as a Self-Supervised Method to Improve Exploration in De novo Drug DesignCode0
Balancing Value Underestimation and Overestimation with Realistic Actor-CriticCode0
Show:102550
← PrevPage 4 of 21Next →

No leaderboard results yet.