SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 110 of 514 papers

TitleStatusHype
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning0
Go-Browse: Training Web Agents with Structured Exploration0
DrSR: LLM based Scientific Equation Discovery with Dual Reasoning from Data and Experience0
WoMAP: World Models For Embodied Open-Vocabulary Object Localization0
MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary ProgrammingCode2
HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold30
DISCOVER: Automated Curricula for Sparse-Reward Reinforcement LearningCode0
STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMsCode0
Comparative Analysis of Black-Box Optimization Methods for Weather Intervention Design0
IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-TuningCode0
Show:102550
← PrevPage 1 of 52Next →

No leaderboard results yet.