SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 401425 of 514 papers

TitleStatusHype
Receding Horizon CuriosityCode0
NeuralUCB: Contextual Bandits with Neural Network-Based Exploration0
Deep exploration by novelty-pursuit with maximum state entropy0
Regulatory Focus: Promotion and Prevention Inclinations in Policy Search0
Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood MatchingCode0
Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone MicrocontrollerCode0
Learning Index Selection with Structured Action Spaces0
Biased Estimates of Advantages over Path Ensembles0
n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank0
Improving a State-of-the-Art Heuristic for the Minimum Latency Problem with Data Mining0
Learning to Explore in Motion and Interaction Tasks0
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards0
Directed Exploration for Reinforcement Learning0
Learning-Driven Exploration for Reinforcement LearningCode0
Efficient Exploration via State Marginal MatchingCode0
Learning to Score Behaviors for Guided Policy OptimizationCode0
Self-Supervised Exploration via DisagreementCode1
Worst-Case Regret Bounds for Exploration via Randomized Value Functions0
Clustered Reinforcement Learning0
Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy0
Estimating Risk and Uncertainty in Deep Reinforcement LearningCode0
Learning Exploration Policies for Model-Agnostic Meta-Reinforcement Learning0
Distributional Reinforcement Learning for Efficient Exploration0
Optimizing Routerless Network-on-Chip Designs: An Innovative Learning-Based Framework0
Explicit Recall for Efficient Exploration0
Show:102550
← PrevPage 17 of 21Next →

No leaderboard results yet.