SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 426450 of 514 papers

TitleStatusHype
Beyond Games: Bringing Exploration to Robots in Real-world0
The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human PriorsCode0
Efficient Off-Policy Meta-Reinforcement Learning via Probabilistic Context VariablesCode0
Concurrent Meta Reinforcement LearningCode0
Learning Exploration Policies for NavigationCode1
Bayesian Reinforcement Learning via Deep, Sparse SamplingCode0
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching0
Information-Directed Exploration for Deep Reinforcement LearningCode0
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes0
Playing Text-Adventure Games with Graph-Based Deep Reinforcement LearningCode0
Context-Dependent Upper-Confidence Bounds for Directed Exploration0
Incentivizing Exploration with Selective Data Disclosure0
DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network0
Model-Based Active ExplorationCode1
Multi-Agent Fully Decentralized Value Function Learning with Linear Convergence Rates0
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement LearningCode0
NSGA-Net: Neural Architecture Search using Multi-Objective Genetic AlgorithmCode0
Thompson Sampling Algorithms for Cascading Bandits0
NSGA-Net: A Multi-Objective Genetic Algorithm for Neural Architecture Search0
Exploration by Uncertainty in Reward Space0
CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement LearningCode0
Directed Exploration in PAC Model-Free Reinforcement Learning0
Discovering Context Specific Causal Relationships0
Optimization by Pairwise Linkage Detection, Incremental Linkage Set, and Restricted / Back Mixing: DSMGA-II0
Count-Based Exploration with the Successor RepresentationCode0
Show:102550
← PrevPage 18 of 21Next →

No leaderboard results yet.