SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 101125 of 514 papers

TitleStatusHype
Efficient Exploration of Gradient Space for Online Learning to Rank0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Deep exploration by novelty-pursuit with maximum state entropy0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
Deep Exploration via Randomized Value Functions0
DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
Efficient Exploration Using Extra Safety Budget in Constrained Policy Optimization0
Design of Convolutional Extreme Learning Machines for Vision-Based Navigation Around Small Bodies0
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning0
Differentially Evolving Memory Ensembles: Pareto Optimization based on Computational Intelligence for Embedded Memories on a System Level0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning0
Efficient Pose and Cell Segmentation using Column Generation0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
Diffusion Models Meet Contextual Bandits with Large Action Spaces0
BayesCNS: A Unified Bayesian Approach to Address Cold Start and Non-Stationarity in Search Systems at Scale0
Directed Exploration for Reinforcement Learning0
Directed Exploration in PAC Model-Free Reinforcement Learning0
DISCO-10M: A Large-Scale Music Dataset0
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration0
KOI: Accelerating Online Imitation Learning via Hybrid Key-state Guidance0
Discovering Context Specific Causal Relationships0
BooVI: Provably Efficient Bootstrapped Value Iteration0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
Show:102550
← PrevPage 5 of 21Next →

No leaderboard results yet.