SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 476500 of 514 papers

TitleStatusHype
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices0
Goal-oriented Trajectories for Efficient Exploration0
Go-Browse: Training Web Agents with Structured Exploration0
Go-Explore for Residential Energy Management0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion0
Guarded Policy Optimization with Imperfect Online Demonstrations0
Guided Exploration for Efficient Relational Model Learning0
Hands-Free Segmentation of Medical Volumes via Binary Inputs0
Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning0
HEAT: Hardware-Efficient Automatic Tensor Decomposition for Transformer Compression0
HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold30
Hierarchical reinforcement learning for efficient exploration and transfer0
HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings0
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning0
IB-MVS: An Iterative Algorithm for Deep Multi-View Stereo based on Binary Decisions0
Image-Based Deep Reinforcement Learning with Intrinsically Motivated Stimuli: On the Execution of Complex Robotic Tasks0
Impact of detecting clinical trial elements in exploration of COVID-19 literature0
Implicit Generative Modeling for Efficient Exploration0
Improving a State-of-the-Art Heuristic for the Minimum Latency Problem with Data Mining0
Incentivizing Exploration with Selective Data Disclosure0
Inferring Hierarchical Structure in Multi-Room Maze Environments0
Information Content Exploration0
Interpretable SHAP-bounded Bayesian Optimization for Underwater Acoustic Metamaterial Coating Design0
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search0
Show:102550
← PrevPage 20 of 21Next →

No leaderboard results yet.