SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 351400 of 514 papers

TitleStatusHype
Intrinsically Guided Exploration in Meta Reinforcement Learning0
Online Limited Memory Neural-Linear Bandits0
Robotic Grasping of Fully-Occluded Objects using RF Perception0
Curiosity in exploring chemical space: Intrinsic rewards for deep molecular reinforcement learning0
SAR Image Despeckling Based on Convolutional Denoising Autoencoder0
Model-based Reinforcement Learning for Continuous Control with Posterior SamplingCode0
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization0
A New Bandit Setting Balancing Information from State Evolution and Corrupted ContextCode0
Hierarchical reinforcement learning for efficient exploration and transfer0
Amortized Variational Deep Q NetworkCode0
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning0
Efficient Robotic Object Search via HIEM: Hierarchical Policy Learning with Intrinsic-Extrinsic Modeling0
Deep Learning based Uncertainty Decomposition for Real-time Control0
Efficient, Decentralized, and Collaborative Multi-Robot Exploration using Optimal Transport Theory0
Efficient Exploration for Model-based Reinforcement Learning with Continuous States and Actions0
SEMI: Self-supervised Exploration via Multisensory Incongruity0
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning0
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads0
Hierarchically Organized Latent Modules for Exploratory Search in Morphogenetic SystemsCode0
End-Effect Exploration Drive for Effective Motor Learning0
Task-agnostic Exploration in Reinforcement Learning0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient ExplorationCode0
Multirobot Coverage of Modular EnvironmentsCode0
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning0
Bayesian optimisation of large-scale photonic reservoir computers0
Weakly-Supervised Reinforcement Learning for Controllable Behavior0
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised LearningCode0
Active Model Estimation in Markov Decision Processes0
Efficient Exploration in Constrained Environments with Goal-Oriented Reference Path0
Efficient exploration of zero-sum stochastic games0
Particle Filter Based Monocular Human Tracking with a 3D Cardbox Model and a Novel Deterministic Resampling Strategy0
Misspecification-robust likelihood-free inference in high dimensions0
Minimax Value Interval for Off-Policy Evaluation and Policy Optimization0
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal BabblingCode0
Parameterized Indexed Value Function for Efficient Exploration in Reinforcement LearningCode0
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning0
Provably Efficient Exploration in Policy Optimization0
Explicit Planning for Efficient Exploration in Reinforcement Learning0
Better Exploration with Optimistic Actor CriticCode0
Comprehensive decision-strategy space exploration for efficient territorial planning strategies0
Scaling active inference0
Bayesian Curiosity for Efficient Exploration in Reinforcement LearningCode0
Implicit Generative Modeling for Efficient Exploration0
Efficient Exploration through Intrinsic Motivation Learning for Unsupervised Subgoal Discovery in Model-Free Hierarchical Reinforcement Learning0
Multi-Path Policy Optimization0
MAME : Model-Agnostic Meta-Exploration0
Neural Contextual Bandits with UCB-based ExplorationCode0
Structured exploration in the finite horizon linear quadratic dual control problem0
VASE: Variational Assorted Surprise Exploration for Reinforcement Learning0
Show:102550
← PrevPage 8 of 11Next →

No leaderboard results yet.