SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 376400 of 514 papers

TitleStatusHype
Contextual Decision Processes with Low Bellman Rank are PAC-Learnable0
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model0
Bridging Text and Crystal Structures: Literature-driven Contrastive Learning for Materials Science0
Credit Assignment and Efficient Exploration based on Influence Scope in Multi-agent Reinforcement Learning0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation0
CURO: Curriculum Learning for Relative Overgeneralization0
Data-Efficient Exploration with Self Play for Atari0
Deep Active Ensemble Sampling For Image Classification0
Deep density networks and uncertainty in recommender systems0
Deep exploration by novelty-pursuit with maximum state entropy0
Deep Exploration via Randomized Value Functions0
DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network0
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching0
Design of Convolutional Extreme Learning Machines for Vision-Based Navigation Around Small Bodies0
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning0
Differentially Evolving Memory Ensembles: Pareto Optimization based on Computational Intelligence for Embedded Memories on a System Level0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
Diffusion Models Meet Contextual Bandits with Large Action Spaces0
Directed Exploration for Reinforcement Learning0
Directed Exploration in PAC Model-Free Reinforcement Learning0
DISCO-10M: A Large-Scale Music Dataset0
Discovering Context Specific Causal Relationships0
Show:102550
← PrevPage 16 of 21Next →

No leaderboard results yet.