SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 2650 of 514 papers

TitleStatusHype
Disentangling Uncertainties by Learning Compressed Data RepresentationCode0
Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model0
HyperArm Bandit Optimization: A Novel approach to Hyperparameter Optimization and an Analysis of Bandit Algorithms in Stochastic and Adversarial Settings0
Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration0
Reward-Centered ReST-MCTS: A Robust Decision-Making Framework for Robotic Manipulation in High Uncertainty EnvironmentsCode0
Probabilistic Insights for Efficient Exploration Strategies in Reinforcement Learning0
A Transformer Model for Predicting Chemical Reaction Products from Generic Templates0
Contextualizing biological perturbation experiments through languageCode1
Training a Generally Curious AgentCode1
On Space-Filling Input Design for Nonlinear Dynamic Model Learning: A Gaussian Process Approach0
Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery0
Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
Maximum Entropy Reinforcement Learning with Diffusion PolicyCode1
Massively Scaling Explicit Policy-conditioned Value Functions0
Causal Information Prioritization for Efficient Reinforcement Learning0
Exploratory Diffusion Model for Unsupervised Reinforcement Learning0
Guided Exploration for Efficient Relational Model Learning0
Few-shot_LLM_Synthetic_Data_with_Distribution_MatchingCode0
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation0
GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic EnvironmentsCode1
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic LearningCode1
Constrained Hybrid Metaheuristic Algorithm for Probabilistic Neural Networks Learning0
Show:102550
← PrevPage 2 of 21Next →

No leaderboard results yet.