SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 201225 of 514 papers

TitleStatusHype
Go-Explore for Residential Energy Management0
Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Case Study0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration0
TransNAS-TSAD: Harnessing Transformers for Multi-Objective Neural Architecture Search in Time Series Anomaly DetectionCode0
Joint channel estimation and data detection in massive MIMO systems based on diffusion models0
Consensus-based adaptive sampling and approximation for high-dimensional energy landscapesCode0
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)0
Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration0
Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning0
Efficient Exploration in Continuous-time Model-based Reinforcement Learning0
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion0
Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
Information Content Exploration0
Learning Optimal Power Flow Value Functions with Input-Convex Neural Networks0
Feature Interaction Aware Automated Data Representation TransformationCode0
DREAM: Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems0
Provably Efficient Exploration in Constrained Reinforcement Learning:Posterior Sampling Is All You Need0
Curiosity as a Self-Supervised Method to Improve Exploration in De novo Drug DesignCode0
Learning Spatial and Temporal Hierarchies: Hierarchical Active Inference for navigation in Multi-Room Maze Environments0
Where2Explore: Few-shot Affordance Learning for Unseen Novel Categories of Articulated Objects0
Go Beyond Imagination: Maximizing Episodic Reachability with World ModelsCode0
Reinforcement learning informed evolutionary search for autonomous systems testing0
Bag of Policies for Distributional Deep Exploration0
Show:102550
← PrevPage 9 of 21Next →

No leaderboard results yet.