SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 176200 of 514 papers

TitleStatusHype
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning0
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation0
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning0
Feature Engineering for Predictive Modeling using Reinforcement Learning0
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation0
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization0
ActiveGAMER: Active GAussian Mapping through Efficient Rendering0
Evolutionary Reinforcement Learning via Cooperative Coevolution0
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces0
Explicit Planning for Efficient Exploration in Reinforcement Learning0
Explicit Recall for Efficient Exploration0
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes0
Exploration by Distributional Reinforcement Learning0
Exploration by Learning Diverse Skills through Successor State Measures0
Exploration by Uncertainty in Reward Space0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
Exploration in Model-based Reinforcement Learning with Randomized Reward0
Exploration of the search space of Gaussian graphical models for paired data0
Exploration via Epistemic Value Estimation0
Data-Efficient Exploration with Self Play for Atari0
Exploratory Diffusion Model for Unsupervised Reinforcement Learning0
Deep Active Ensemble Sampling For Image Classification0
Explore until Confident: Efficient Exploration for Embodied Question Answering0
A Straightforward Gradient-Based Approach for High-Tc Superconductor Design: Leveraging Domain Knowledge via Adaptive Constraints0
Computing low-thrust transfers in the asteroid belt, a comparison between astrodynamical manipulations and a machine learning approach0
Show:102550
← PrevPage 8 of 21Next →

No leaderboard results yet.