SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 151175 of 514 papers

TitleStatusHype
TopoNav: Topological Navigation for Efficient Exploration in Sparse Reward Environments0
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgentCode0
Efficient Exploration for LLMs0
Scheduled Curiosity-Deep Dyna-Q: Efficient Exploration for Dialog Policy Learning0
Layered and Staged Monte Carlo Tree Search for SMT Strategy SynthesisCode1
FIT-SLAM -- Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments0
Go-Explore for Residential Energy Management0
Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Case Study0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
A Bayesian Framework of Deep Reinforcement Learning for Joint O-RAN/MEC Orchestration0
TransNAS-TSAD: Harnessing Transformers for Multi-Objective Neural Architecture Search in Time Series Anomaly DetectionCode0
Joint channel estimation and data detection in massive MIMO systems based on diffusion models0
Consensus-based adaptive sampling and approximation for high-dimensional energy landscapesCode0
Virtual Action Actor-Critic Framework for Exploration (Student Abstract)0
Regret Analysis of Learning-Based Linear Quadratic Gaussian Control with Additive Exploration0
Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning0
Efficient Exploration in Continuous-time Model-based Reinforcement Learning0
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion0
Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models0
Information Content Exploration0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
Learning Optimal Power Flow Value Functions with Input-Convex Neural Networks0
PGDQN: Preference-Guided Deep Q-NetworkCode1
Feature Interaction Aware Automated Data Representation TransformationCode0
DREAM: Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems0
Show:102550
← PrevPage 7 of 21Next →

No leaderboard results yet.