SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 326350 of 514 papers

TitleStatusHype
Adversarially Guided Actor-CriticCode1
Online Limited Memory Neural-Linear Bandits with Likelihood MatchingCode0
Sparse Reward Exploration via Novelty Search and EmittersCode0
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors0
Autonomous synthesis of metastable materials0
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation0
Intrinsically Guided Exploration in Meta Reinforcement Learning0
Optimistic Exploration with Backward Bootstrapped Bonus for Deep Reinforcement Learning0
Online Limited Memory Neural-Linear Bandits0
MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning0
Robotic Grasping of Fully-Occluded Objects using RF Perception0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
BeBold: Exploration Beyond the Boundary of Explored RegionsCode1
SAR Image Despeckling Based on Convolutional Denoising Autoencoder0
Hybrid Genetic Search for the CVRP: Open-Source Implementation and SWAP* NeighborhoodCode1
Model-based Reinforcement Learning for Continuous Control with Posterior SamplingCode0
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization0
A New Bandit Setting Balancing Information from State Evolution and Corrupted ContextCode0
Hierarchical reinforcement learning for efficient exploration and transfer0
Amortized Variational Deep Q NetworkCode0
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Deep Learning based Uncertainty Decomposition for Real-time Control0
Latent World Models For Intrinsically Motivated ExplorationCode1
Efficient, Decentralized, and Collaborative Multi-Robot Exploration using Optimal Transport Theory0
Show:102550
← PrevPage 14 of 21Next →

No leaderboard results yet.