SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 276300 of 514 papers

TitleStatusHype
LECO: Learnable Episodic Count for Task-Specific Intrinsic RewardCode0
The Role of Coverage in Online Reinforcement Learning0
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning0
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning0
An Empirical Evaluation of Posterior Sampling for Constrained Reinforcement LearningCode0
Impact Makes a Sound and Sound Makes an Impact: Sound Guides Representations and ExplorationsCode0
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks0
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks0
The split Gibbs sampler revisited: improvements to its algorithmic structure and augmented target distributionCode0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback0
Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems0
On Preemption and Learning in Stochastic SchedulingCode0
Personalized Algorithmic Recourse with Preference ElicitationCode0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning0
Feature and Instance Joint Selection: A Reinforcement Learning Perspective0
Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based GamesCode0
On Machine Learning-Driven Surrogates for Sound Transmission Loss SimulationsCode0
A Variational Approach to Bayesian Phylogenetic InferenceCode0
Efficient Exploration via First-Person Behavior Cloning Assisted Rapidly-Exploring Random Trees0
TANDEM: Learning Joint Exploration and Decision Making with Tactile Sensors0
Collaborative Training of Heterogeneous Reinforcement Learning Agents in Environments with Sparse Rewards: What and When to Share?Code0
Learning Causal Overhypotheses through Exploration in Children and Computational Models0
A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search0
Show:102550
← PrevPage 12 of 21Next →

No leaderboard results yet.