SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 101125 of 514 papers

TitleStatusHype
Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
Massively Scaling Explicit Policy-conditioned Value Functions0
Causal Information Prioritization for Efficient Reinforcement Learning0
Exploratory Diffusion Model for Unsupervised Reinforcement Learning0
Guided Exploration for Efficient Relational Model Learning0
Few-shot_LLM_Synthetic_Data_with_Distribution_MatchingCode0
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation0
Constrained Hybrid Metaheuristic Algorithm for Probabilistic Neural Networks Learning0
Mapping Galaxy Images Across Ultraviolet, Visible and Infrared Bands Using Generative Deep LearningCode0
Multi-Objective Hyperparameter Selection via Hypothesis Testing on Reliability GraphsCode0
Bridging Text and Crystal Structures: Literature-driven Contrastive Learning for Materials Science0
ActiveGAMER: Active GAussian Mapping through Efficient Rendering0
β-DQN: Improving Deep Q-Learning By Evolving the Behavior0
Provably Efficient Exploration in Reward Machines with Low Regret0
A diversity-enhanced genetic algorithm for efficient exploration of parameter spacesCode0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
GenPlan: Generative Sequence Models as Adaptive PlannersCode0
A Temporally Correlated Latent Exploration for Reinforcement Learning0
Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning0
Sample Efficient Robot Learning in Supervised Effect Prediction Tasks0
CBOL-Tuner: Classifier-pruned Bayesian optimization to explore temporally structured latent spaces for particle accelerator tuning0
Adaptformer: Sequence models as adaptive iterative planners0
Show:102550
← PrevPage 5 of 21Next →

No leaderboard results yet.