SOTAVerified

Efficient Exploration

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Papers

Showing 201250 of 514 papers

TitleStatusHype
DISCO-10M: A Large-Scale Music Dataset0
Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion0
Directed Exploration in PAC Model-Free Reinforcement Learning0
Biased Estimates of Advantages over Path Ensembles0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
Directed Exploration for Reinforcement Learning0
Go-Browse: Training Web Agents with Structured Exploration0
Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning0
Diffusion Models Meet Contextual Bandits with Large Action Spaces0
Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving0
Beyond Games: Bringing Exploration to Robots in Real-world0
Approximate information for efficient exploration-exploitation strategies0
GFlowNets for AI-Driven Scientific Discovery0
Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning0
DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models0
Differentially Evolving Memory Ensembles: Pareto Optimization based on Computational Intelligence for Embedded Memories on a System Level0
Better Exploration with Optimistic Actor-Critic0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
A Compression-Inspired Framework for Macro Discovery0
GAN-based Intrinsic Exploration For Sample Efficient Reinforcement Learning0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
Deterministic Sequencing of Exploration and Exploitation for Reinforcement Learning0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
FragFM: Hierarchical Framework for Efficient Molecule Generation via Fragment-Level Discrete Flow Matching0
Design of Convolutional Extreme Learning Machines for Vision-Based Navigation Around Small Bodies0
Fractional Langevin Monte Carlo: Exploring Lévy Driven Stochastic Differential Equations for Markov Chain Monte Carlo0
Fractional Langevin Monte Carlo: Exploring Levy Driven Stochastic Differential Equations for MCMC0
f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences0
Deep Neural Linear Bandits: Overcoming Catastrophic Forgetting through Likelihood Matching0
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices0
FIT-SLAM -- Fisher Information and Traversability estimation-based Active SLAM for exploration in 3D environments0
DEEPGONET: Multi-label Prediction of GO Annotation for Protein from Sequence Using Cascaded Convolutional and Recurrent Network0
Goal-oriented Trajectories for Efficient Exploration0
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts0
Finding Waldo: Towards Efficient Exploration of NeRF Scene Spaces0
Deep Exploration via Randomized Value Functions0
Go-Explore for Residential Energy Management0
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
β-DQN: Improving Deep Q-Learning By Evolving the Behavior0
Guarded Policy Optimization with Imperfect Online Demonstrations0
Guided Exploration for Efficient Relational Model Learning0
Hands-Free Segmentation of Medical Volumes via Binary Inputs0
Feature Engineering for Predictive Modeling using Reinforcement Learning0
Deep exploration by novelty-pursuit with maximum state entropy0
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems0
An Explainable Nature-Inspired Framework for Monkeypox Diagnosis: Xception Features Combined with NGBoost and African Vultures Optimization Algorithm0
Adaptive Exploration for Multi-Reward Multi-Policy Evaluation0
Feature and Instance Joint Selection: A Reinforcement Learning Perspective0
Fast exploration and learning of latent graphs with aliased observations0
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs0
Show:102550
← PrevPage 5 of 11Next →

No leaderboard results yet.