SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 301325 of 514 papers

TitleStatusHype
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Exploratory State Representation LearningCode0
Exploring More When It Needs in Deep Reinforcement Learning0
Differentially Evolving Memory Ensembles: Pareto Optimization based on Computational Intelligence for Embedded Memories on a System Level0
Multi-Agent Embodied Visual Semantic Navigation with Scene Prior Knowledge0
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain0
Bootstrapped Meta-LearningCode0
A Gradient Sampling Algorithm for Stratified Maps with Applications to Topological Data AnalysisCode0
Strategically Efficient Exploration in Competitive Multi-agent Reinforcement LearningCode1
Learn2Hop: Learned Optimization on Rough Landscapes0
Multimodal Reward Shaping for Efficient Exploration in Reinforcement Learning0
MADE: Exploration via Maximizing Deviation from Explored RegionsCode1
Data-Efficient Exploration with Self Play for Atari0
Impact of detecting clinical trial elements in exploration of COVID-19 literature0
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
Principled Exploration via Optimistic Bootstrapping and Backward InductionCode0
Deep Bandits Show-Off: Simple and Efficient Exploration with Deep NetworksCode1
Paradiseo: From a Modular Framework for Evolutionary Computation to the Automated Design of Metaheuristics ---22 Years of Paradiseo---Code1
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores0
Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement LearningCode0
Nonlinear model reduction for slow-fast stochastic systems near unknown invariant manifoldsCode0
Efficient Optimal Selection for Composited Advertising Creatives with Tree StructureCode0
State Entropy Maximization with Random Encoders for Efficient ExplorationCode1
Learning Memory-Dependent Continuous Control from Demonstrations0
Meta-Thompson Sampling0
Show:102550
← PrevPage 13 of 21Next →

No leaderboard results yet.