SOTAVerified|Agents Browse Leaderboard About

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–260 of 514 papers

Title	Date	Tasks	Status	Hype
The split Gibbs sampler revisited: improvements to its algorithmic structure and augmented target distribution	Jun 28, 2022	Data AugmentationDeblurring	CodeCode Available	0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation	Jun 22, 2022	Efficient ExplorationObject	—Unverified	0
A Langevin-like Sampler for Discrete Distributions	Jun 20, 2022	Efficient ExplorationText Generation	CodeCode Available	1
Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback	Jun 13, 2022	Computational EfficiencyEfficient Exploration	—Unverified	0
On Preemption and Learning in Stochastic Scheduling	May 31, 2022	Efficient ExplorationScheduling	CodeCode Available	0
Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems	May 31, 2022	Efficient Explorationreinforcement-learning	—Unverified	0
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions	May 28, 2022	Arithmetic ReasoningEfficient Exploration	CodeCode Available	1
Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration	May 27, 2022	Efficient Explorationgraph partitioning	CodeCode Available	1
Personalized Algorithmic Recourse with Preference Elicitation	May 27, 2022	Efficient Exploration	CodeCode Available	0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning	May 26, 2022	continuous-controlContinuous Control	—Unverified	0

Show:10 25 50

← PrevPage 26 of 52Next →

No leaderboard results yet.