SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 76100 of 514 papers

TitleStatusHype
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning0
Beyond Games: Bringing Exploration to Robots in Real-world0
Approximate information for efficient exploration-exploitation strategies0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
Better Exploration with Optimistic Actor-Critic0
A Compression-Inspired Framework for Macro Discovery0
Divide and Explore: Multi-Agent Separate Exploration with Shared Intrinsic Motivations0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
β-DQN: Improving Deep Q-Learning By Evolving the Behavior0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm0
Distributional Reinforcement Learning for Efficient Exploration0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Bayesian optimization of distributed neurodynamical controller models for spatial navigation0
Adaptformer: Sequence models as adaptive iterative planners0
Data-Efficient Exploration with Self Play for Atari0
Bayesian optimisation of large-scale photonic reservoir computers0
A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage0
CURO: Curriculum Learning for Relative Overgeneralization0
A Community Based Algorithm for Large Scale Web Service Composition0
Deep Active Ensemble Sampling For Image Classification0
Discovering Context Specific Causal Relationships0
Distilling Realizable Students from Unrealizable Teachers0
Show:102550
← PrevPage 4 of 21Next →

No leaderboard results yet.