SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 176200 of 514 papers

TitleStatusHype
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need0
Curiosity as a Self-Supervised Method to Improve Exploration in De novo Drug DesignCode0
Learning Spatial and Temporal Hierarchies: Hierarchical Active Inference for navigation in Multi-Room Maze Environments0
Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects0
Go Beyond Imagination: Maximizing Episodic Reachability with World ModelsCode0
Reinforcement learning informed evolutionary search for autonomous systems testing0
Bag of Policies for Distributional Deep Exploration0
Towards A Unified Agent with Foundation Models0
LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search0
Approximate information for efficient exploration-exploitation strategies0
Improving Protein Optimization with Smoothed Fitness LandscapesCode1
Maximum State Entropy Exploration using Predecessor and Successor Representations0
DISCO-10M: A Large-Scale Music Dataset0
Inferring Hierarchical Structure in Multi-Room Maze Environments0
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
Tuning Legged Locomotion Controllers via Safe Bayesian OptimizationCode1
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm0
Magnitude Attention-based Dynamic Pruning0
Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial OptimizationCode0
Large-Batch, Iteration-Efficient Neural Bayesian Design OptimizationCode0
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search0
A Survey of Label-Efficient Deep Learning for 3D Point CloudsCode1
EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement LearningCode0
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte CarloCode1
Show:102550
← PrevPage 8 of 21Next →

No leaderboard results yet.