SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 451475 of 514 papers

TitleStatusHype
Exploration by Uncertainty in Reward Space0
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain0
Exploration in Model-based Reinforcement Learning with Randomized Reward0
Exploration of the search space of Gaussian graphical models for paired data0
Exploration via Epistemic Value Estimation0
Exploratory Diffusion Model for Unsupervised Reinforcement Learning0
Explore until Confident: Efficient Exploration for Embodied Question Answering0
Exploring More When It Needs in Deep Reinforcement Learning0
Cognitive Planning for Object Goal Navigation using Generative AI Models0
Extended Formulations for Online Linear Bandit Optimization0
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs0
Fast exploration and learning of latent graphs with aliased observations0
Feature and Instance Joint Selection: A Reinforcement Learning Perspective0
Feature Engineering for Predictive Modeling using Reinforcement Learning0
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
FIT-SLAM -- Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for MCMC0
Fractional Langevin Monte Carlo: Exploring Lévy Driven Stochastic Differential Equations for Markov Chain Monte Carlo0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
GFlowNets for AI-Driven Scientific Discovery0
Show:102550
← PrevPage 19 of 21Next →

No leaderboard results yet.