SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 426450 of 514 papers

TitleStatusHype
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization0
Efficient Exploration using Model-Based Quality-Diversity with Gradients0
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization0
Efficient exploration with Double Uncertain Value Networks0
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards0
Efficient gPC-based quantification of probabilistic robustness for systems in neuroscience0
Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation0
Efficient Policy Space Response Oracles0
Efficient Pose and Cell Segmentation using Column Generation0
Reinforcement Learning for Causal Discovery without Acyclicity Constraints0
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Embodied Agents for Efficient Exploration and Smart Scene Description0
Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis0
End-Effect Exploration Drive for Effective Motor Learning0
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning0
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation0
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning0
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation0
Evolutionary Reinforcement Learning via Cooperative Coevolution0
Explicit Planning for Efficient Exploration in Reinforcement Learning0
Explicit Recall for Efficient Exploration0
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes0
Exploration by Distributional Reinforcement Learning0
Exploration by Learning Diverse Skills through Successor State Measures0
Show:102550
← PrevPage 18 of 21Next →

No leaderboard results yet.