SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 126150 of 514 papers

TitleStatusHype
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
ActSafe: Active Exploration with Safety Constraints for Reinforcement Learning0
Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis0
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning0
Bandit Algorithms for Tree Search0
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Reinforcement Learning for Causal Discovery without Acyclicity Constraints0
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization0
Embodied Agents for Efficient Exploration and Smart Scene Description0
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation0
Bag of Policies for Distributional Deep Exploration0
Bridging Text and Crystal Structures: Literature-driven Contrastive Learning for Materials Science0
A Human Mixed Strategy Approach to Deep Reinforcement Learning0
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model0
A Web-scale system for scientific knowledge exploration0
Efficient Policy Space Response Oracles0
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable0
Context-Dependent Upper-Confidence Bounds for Directed Exploration0
Active Model Estimation in Markov Decision Processes0
Constrained Hybrid Metaheuristic Algorithm for Probabilistic Neural Networks Learning0
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation0
ACE : Off-Policy Actor-Critic with Causality-Aware Entropy Regularization0
Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation0
Show:102550
← PrevPage 6 of 21Next →

No leaderboard results yet.