SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 125 of 514 papers

TitleStatusHype
Streamlining Ocean Dynamics Modeling with Fourier Neural Operators: A Multiobjective Hyperparameter and Architecture Optimization ApproachCode7
Cradle: Empowering Foundation Agents Towards General Computer ControlCode7
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical ReasoningCode5
MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary ProgrammingCode2
ForesightNav: Learning Scene Imagination for Efficient ExplorationCode2
GenNBV: Generalizable Next-Best-View Policy for Active 3D ReconstructionCode2
Iterated Denoising Energy Matching for Sampling from Boltzmann DensitiesCode2
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction FollowingCode2
LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and CosmologyCode2
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical RobotCode2
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language NavigationCode2
Online Decision TransformerCode2
Contextualizing biological perturbation experiments through languageCode1
Training a Generally Curious AgentCode1
Maximum Entropy Reinforcement Learning with Diffusion PolicyCode1
GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic EnvironmentsCode1
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic LearningCode1
Leveraging Skills from Unlabeled Prior Data for Efficient Online ExplorationCode1
Persistent Sampling: Enhancing the Efficiency of Sequential Monte CarloCode1
Exciting Action: Investigating Efficient Exploration for Learning Musculoskeletal Humanoid LocomotionCode1
Evolutionary Large Language Model for Automated Feature TransformationCode1
Navigating Chemical Space with Latent FlowsCode1
Diffusion-Reinforcement Learning Hierarchical Motion Planning in Multi-agent Adversarial GamesCode1
MAMBA: an Effective World Model Approach for Meta-Reinforcement LearningCode1
Safe Guaranteed Exploration for Non-linear SystemsCode1
Show:102550
← PrevPage 1 of 21Next →

No leaderboard results yet.