SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 251260 of 514 papers

TitleStatusHype
The split Gibbs sampler revisited: improvements to its algorithmic structure and augmented target distributionCode0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
A Langevin-like Sampler for Discrete DistributionsCode1
Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback0
On Preemption and Learning in Stochastic SchedulingCode0
Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems0
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct SolutionsCode1
Learning to Solve Combinatorial Graph Partitioning Problems via Efficient ExplorationCode1
Personalized Algorithmic Recourse with Preference ElicitationCode0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning0
Show:102550
← PrevPage 26 of 52Next →

No leaderboard results yet.