SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 251300 of 514 papers

TitleStatusHype
Opinion-Guided Reinforcement Learning0
Optimistic Exploration with Backward Bootstrapped Bonus for Deep Reinforcement Learning0
Optimization by Pairwise Linkage Detection, Incremental Linkage Set, and Restricted / Back Mixing: DSMGA-II0
Optimizing Routerless Network-on-Chip Designs: An Innovative Learning-Based Framework0
Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL0
PACER: A Fully Push-forward-based Distributional Reinforcement Learning Algorithm0
ParamsDrag: Interactive Parameter Space Exploration via Image-Space Dragging0
Particle Filter Based Monocular Human Tracking with a 3D Cardbox Model and a Novel Deterministic Resampling Strategy0
PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning0
Planning to the Information Horizon of BAMDPs via Epistemic State Abstraction0
PlotThread: Creating Expressive Storyline Visualizations using Reinforcement Learning0
Policy Mirror Descent Inherently Explores Action Space0
Probabilistic Insights for Efficient Exploration Strategies in Reinforcement Learning0
Processing Document Collections to Automatically Extract Linked Data: Semantic Storytelling Technologies for Smart Curation Workflows0
Protein design by multiobjective optimization: evolutionary and non-evolutionary approaches0
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need0
Provably Efficient Exploration in Inverse Constrained Reinforcement Learning0
Provably Efficient Exploration in Policy Optimization0
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret0
Provably Efficient Exploration in Reward Machines with Low Regret0
Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP0
QueryBuilder: Human-in-the-Loop Query Development for Information Retrieval0
Randomized-Grid Search for Hyperparameter Tuning in Decision Tree Model to Improve Performance of Cardiovascular Disease Classification0
Deep Learning based Uncertainty Decomposition for Real-time Control0
Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning0
Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration0
Regulatory Focus: Promotion and Prevention Inclinations in Policy Search0
Reinforced dynamics for enhanced sampling in large atomic and molecular systems0
Reinforcement learning informed evolutionary search for autonomous systems testing0
Reinforcement Learning in Reward-Mixing MDPs0
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization0
Robotic Grasping of Fully-Occluded Objects using RF Perception0
Dimension-Robust MCMC in Bayesian Inverse Problems0
Safe Exploration of State and Action Spaces in Reinforcement Learning0
Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions0
Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time0
Sample-Efficient, Exploration-Based Policy Optimisation for Routing Problems0
Sample Efficient Robot Learning in Supervised Effect Prediction Tasks0
Sampling for Model Predictive Trajectory Planning in Autonomous Driving using Normalizing Flows0
SAR Image Despeckling Based on Convolutional Denoising Autoencoder0
Scalable Exploration for Neural Online Learning to Rank with Perturbed Feedback0
Scaling active inference0
Scattered Forest Search: Smarter Code Space Exploration with LLMs0
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning0
Self-supervised Sequential Information Bottleneck for Robust Exploration in Deep Reinforcement Learning0
SEMI: Self-supervised Exploration via Multisensory Incongruity0
Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models0
SHIRO: Soft Hierarchical Reinforcement Learning0
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks0
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution0
Show:102550
← PrevPage 6 of 11Next →

No leaderboard results yet.