SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 51100 of 514 papers

TitleStatusHype
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic LearningCode1
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte CarloCode1
Learning Exploration Policies for NavigationCode1
Hybrid Genetic Search for the CVRP: Open-Source Implementation and SWAP* NeighborhoodCode1
NovelD: A Simple yet Effective Exploration CriterionCode1
Contextualizing biological perturbation experiments through languageCode1
GeoThermalCloud: Machine Learning for Geothermal Resource ExplorationCode1
HyperDQN: A Randomized Exploration Method for Deep Reinforcement LearningCode1
A Survey of Label-Efficient Deep Learning for 3D Point CloudsCode1
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven ExplorationCode1
A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?Code1
Latent World Models For Intrinsically Motivated ExplorationCode1
Evolutionary Large Language Model for Automated Feature TransformationCode1
Generative Colorization of Structured Mobile Web PagesCode1
SC-Explorer: Incremental 3D Scene Completion for Safe and Efficient Exploration Mapping and PlanningCode1
Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid LocomotionCode1
Adversarially Guided Actor-CriticCode1
MADE: Exploration via Maximizing Deviation from Explored RegionsCode1
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
BooVI: Provably Efficient Bootstrapped Value Iteration0
MAGMA: An Optimization Framework for Mapping Multiple DNNs on Multiple Accelerator Cores0
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration0
Biased Estimates of Advantages over Path Ensembles0
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance0
DREAM: Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems0
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning0
Beyond Games: Bringing Exploration to Robots in Real-world0
Approximate information for efficient exploration-exploitation strategies0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
Better Exploration with Optimistic Actor-Critic0
A Compression-Inspired Framework for Macro Discovery0
Divide and Explore: Multi-Agent Separate Exploration with Shared Intrinsic Motivations0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
β-DQN: Improving Deep Q-Learning By Evolving the Behavior0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm0
Distributional Reinforcement Learning for Efficient Exploration0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Bayesian optimization of distributed neurodynamical controller models for spatial navigation0
Adaptformer: Sequence models as adaptive iterative planners0
Data-Efficient Exploration with Self Play for Atari0
Bayesian optimisation of large-scale photonic reservoir computers0
A Natural Extension To Online Algorithms For Hybrid RL With Limited Coverage0
CURO: Curriculum Learning for Relative Overgeneralization0
A Community Based Algorithm for Large Scale Web Service Composition0
Deep Active Ensemble Sampling For Image Classification0
Discovering Context Specific Causal Relationships0
Distilling Realizable Students from Unrealizable Teachers0
Show:102550
← PrevPage 2 of 11Next →

No leaderboard results yet.