SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 176200 of 514 papers

TitleStatusHype
Few-shot_LLM_Synthetic_Data_with_Distribution_MatchingCode0
Count-Based Exploration with the Successor RepresentationCode0
Noisy Networks for ExplorationCode0
Fire Burns, Sword Cuts: Commonsense Inductive Bias for Exploration in Text-based GamesCode0
Personalized Algorithmic Recourse with Preference ElicitationCode0
Estimating Risk and Uncertainty in Deep Reinforcement LearningCode0
EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement LearningCode0
A diversity-enhanced genetic algorithm for efficient exploration of parameter spacesCode0
Feature Interaction Aware Automated Data Representation TransformationCode0
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient ExplorationCode0
Online Limited Memory Neural-Linear Bandits with Likelihood MatchingCode0
Exploring through Random Curiosity with General Value FunctionsCode0
Federated Control with Hierarchical Multi-Agent Deep Reinforcement LearningCode0
CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement LearningCode0
LECO: Learnable Episodic Count for Task-Specific Intrinsic RewardCode0
Bayesian Curiosity for Efficient Exploration in Reinforcement LearningCode0
Principled Exploration via Optimistic Bootstrapping and Backward InductionCode0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Distilling Realizable Students from Unrealizable Teachers0
Discovering Context Specific Causal Relationships0
BooVI: Provably Efficient Bootstrapped Value Iteration0
DISCO-10M: A Large-Scale Music Dataset0
Directed Exploration in PAC Model-Free Reinforcement Learning0
Biased Estimates of Advantages over Path Ensembles0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
Show:102550
← PrevPage 8 of 21Next →

No leaderboard results yet.