SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 276300 of 514 papers

TitleStatusHype
Sample Efficient Robot Learning in Supervised Effect Prediction Tasks0
Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows0
SAR Image Despeckling Based on Convolutional Denoising Autoencoder0
Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback0
Scaling active inference0
Scattered Forest Search: Smarter Code Space Exploration with LLMs0
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning0
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning0
SEMI: Self-supervised Exploration via Multisensory Incongruity0
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models0
SHIRO: Soft Hierarchical Reinforcement Learning0
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks0
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution0
Sparse graphs using exchangeable random measures0
Misspecification-robust likelihood-free inference in high dimensions0
n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank0
Structured exploration in the finite horizon linear quadratic dual control problem0
Successor-Predecessor Intrinsic Exploration0
Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery0
TANDEM: Learning Joint Exploration and Decision Making with Tactile Sensors0
Targeting the partition function of chemically disordered materials with a generative approach based on inverse variational autoencoders0
Task-agnostic Exploration in Reinforcement Learning0
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning0
The Eigenoption-Critic Framework0
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors0
Show:102550
← PrevPage 12 of 21Next →

No leaderboard results yet.