SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 401450 of 514 papers

TitleStatusHype
Distilling Realizable Students from Unrealizable Teachers0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Distributional Reinforcement Learning for Efficient Exploration0
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning0
Divide and Explore: Multi-Agent Separate Exploration with Shared Intrinsic Motivations0
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores0
DREAM: Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
Efficient, Decentralized, and Collaborative Multi-Robot Exploration using Optimal Transport Theory0
EfficientEQA: An Efficient Approach for Open Vocabulary Embodied Question Answering0
Efficient Exploration and Discriminative World Model Learning with an Object-Centric Abstraction0
Efficient Exploration and Value Function Generalization in Deterministic Systems0
Efficient Exploration for LLMs0
Efficient Exploration for Model-based Reinforcement Learning with Continuous States and Actions0
Efficient Exploration in Binary and Preferential Bayesian Optimization0
Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path0
Efficient Exploration in Continuous-time Model-based Reinforcement Learning0
Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm0
Efficient Exploration in Resource-Restricted Reinforcement Learning0
Efficient Exploration of Gradient Space for Online Learning to Rank0
A Straightforward Gradient-Based Approach for High-Tc Superconductor Design: Leveraging Domain Knowledge via Adaptive Constraints0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization0
Efficient exploration of zero-sum stochastic games0
Efficient Exploration through Intrinsic Motivation Learning for Unsupervised Subgoal Discovery in Model-Free Hierarchical Reinforcement Learning0
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization0
Efficient Exploration using Model-Based Quality-Diversity with Gradients0
Efficient Exploration via Epistemic-Risk-Seeking Policy Optimization0
Efficient exploration with Double Uncertain Value Networks0
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards0
Efficient gPC-based quantification of probabilistic robustness for systems in neuroscience0
Efficient Informed Proposals for Discrete Distributions via Newton's Series Approximation0
Efficient Policy Space Response Oracles0
Efficient Pose and Cell Segmentation using Column Generation0
Reinforcement Learning for Causal Discovery without Acyclicity Constraints0
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Embodied Agents for Efficient Exploration and Smart Scene Description0
Emotion-Agent: Unsupervised Deep Reinforcement Learning with Distribution-Prototype Reward for Continuous Emotional EEG Analysis0
End-Effect Exploration Drive for Effective Motor Learning0
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning0
Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation0
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning0
ESC: Exploration with Soft Commonsense Constraints for Zero-shot Object Navigation0
Evolutionary Reinforcement Learning via Cooperative Coevolution0
Explicit Planning for Efficient Exploration in Reinforcement Learning0
Explicit Recall for Efficient Exploration0
Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes0
Exploration by Distributional Reinforcement Learning0
Exploration by Learning Diverse Skills through Successor State Measures0
Show:102550
← PrevPage 9 of 11Next →

No leaderboard results yet.